• « 



1 ATGGAGCGCA CAGCAGGCAA AGAGCTQGCC CTGGCACCGC TGCAGGACTG 
51 GGGTGAAGAG ACCGAGGACG GCGCGGTGTA CAGTGTCTCC CTGCGGOGGC 
101 AGCGCAGTCA GCGCAGGAGC CCGGCGGAGG GCCCCQGGGG CAGCCAGGCT 
151. CCCAGCCCCA TTGCCAATAC CTTCCTCCAC TATOGAACCA GCAAGGTCAG 
201 GGTGCTGAGG GCAGCGCGCC TGGAGCGGCT GGTGGGAGAG TTGGTGTTTG 
251 GAGACCGTGA GCAGGACCCC AGCTTCATGC CCGCCTTCCT GGCCACCTAC 
301 CGGACCTTTG TACCCACTGC CTGCCTGCTG GGCTTTCTGC TGCCACCAAT 
351 GCCACCGCCC CCACCTCCCG GGGTAGAGAT CAAGAAGACA GCGGTACAAG 
401 ATCTGAGCTT CAACAAGAAC CTCAGGGCTG TGGTGTCAGT GCTGGGCTCC 
451 TGGCTGCAGG ACCACCCTCA GGATTTCCGA GACCACCCTG TCCATTCGGA 
501 CCTQGGCAGT GTCOGAACCT TTCTCGGCTG GGC3GGCCCCA GGGAGTGCTG 
551 AGGCTCAAAA AGCAGAGAAG CTTCTGGAAG Al 1 1 1 I IGGA GGAGGCTGAG 
601 CGAGAGCAGG AAGAGGAGCC GCCTCAGGTG TGGACAGGAC CTCCCAGAGT 
651 TGCCCAAACT TCTGACCCAG ACTCTTCAGA GGCCTGCGCG GAGGAAGAGG 
701 AAGGGCTCAT GCCTCAAGGT CCCCAGCTCC TGGACTTCAG CGTGGACGAG 
751 GTGGCCGAGC AGCTCACCCT CATAGACTTG GAGCTCTTCT CCAAGGTGAG 
801 GCTCTACGAG TGCTTGGGCT CCGTGTGGTC GCAGAGGGAC CGGCCGGGGG 
851 CTGCAGGCGC CTCCCCCACT GTGCGCGCCA CCGTGGCCCA GTTCAACACC 
901 GTGACCGGCT GTGTCCTGGG TTCCGTGCTC GGAGCACCGG GCTTGGCCGC 
951 CCCGCAGAGG GCGCAGGGGC TGGAGAAGTG GATCCGCATC GCCCAGGGCT 
1001 GCCGAGAACT GCGGAACTTC TCCTCCTTCC GCGCCATCCT GTCCGCCCTG 
1051 CAATCTAACC CCATCTACCG GCTCAAGCGC AGCTGGGGGG CAGTGAGCCG 
1101 GGAACCGCTA TCTACTTTCA GGAAACTTTC GCAGATTTTC TCCGATGAGA 
1151 ACAACCACCT CAGCAGCAGA GAGATTCTTT TCCAGGAGGA GGCCACTCAG 
1201 GGATCCCAAG AAGAGGACAA CACCCCAGGC AGCCTGCCCT CAAAACCACC 
1251 CCCAGGCCCT GTCCCCTACC TTGGCACCTT CCTTACGGAC CTGGTTATGC 
1301 TGGACACAGC CCTCCCGGAT ATGTTGGAGG GGGATCTCAT TAACTTTCAG 
1351 AAGAGGAGGA AGGAGTGGGA GATCCTGGCC CGCATCCAGC AGCTGCAGAG 
1401 GCGCTGTCAG AGCTACACCC TGAGCCCCCA CCCGCCCATC CTGGCTGCCC 
1451 TGCATGCCCA GAACCAGCTC ACCGAGGAGC AGAGCTACCG GCTCTCCCGG 
1501 GTCATTGAGC CACCAGCTGC CTCCTGCCCC AGCTCCCCAC GCATCCGACG 
1551 GCGGATCAGC CTCACCAAGC GTCTCAGTGC GAAGCTTGCC CGAGAGAAAA 
1601 GCTCATCACC TAGTGGGAGT CCCGGGGACC CCTCATCCCC CACCTCCAGT 
1651 GTGTCCCCAG GGTCACCCCC CTCAAGTCCT AGAAGCAGAG ATGCTCCTGC 
1701 TGGCAGTCCC CCGGCCTCTC CAGGGCCCCA GGGCCCCAGC ACCAAGCTGC 
1751 CCCTGAGCCT GGACCTGCCC AGCCCCCGGC CCTTCGCTTT GCCTCTGGGC 
1801 AGCCCTCGAA TCCCCCTCCC GGCGCAGCAG AGCTCGGAGG CCCGTGTCAT 
1851 CCGCGTCAGC ATCGACAATG ACCAGGGGAA CCTGTATCGA AGCATCTTGC 
1901 TGACCAGTCA GGACAAAGCC CCCAGCGTGG TCCGGCGAGC CTTGCAGAAG 
1951 CACAATGTGC CCCAGCCCTG GGCCTGTGAC TATCAGCTCT TTCAAGTCCT 
2001 TCCTGGGGAC CGGGTGCTCC TGATTCCTGA CAATGCCAAC GTCTTCTATG 
2051 CCATGAGTCC AGTCGCCCCC AGAGACTTCA TGCTGCGGCG GAAAGAGGGG 
2101 ACCCGGAACA CTCTGTCTGT CTCCCCAAGC TGA (SEQ ID NO:l) 



FEATURES: 
Start Codon: 1 
stop Godon: 2131 
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Homologous proteins: 
TOP 10 BLAST Hits 



CRA 
CRA 
CRA 
CRA 
CRA 
CRA 
CRA 
CRA 
CRA 
CRA 



8650435 /def=gb|AAF78208.1 
7682471 /def=gb|AAF67280.1 
4589562 /def=dbi | BAA76803 . 1 
10185686 /def=gb|AAGl4400.1 



160000003781983 /altid=gi 
103000001517213 /altid=qi 
18000005222968 /altid=g? 
38000046964621 /altid=gi 
18000004932640 /altid=gi 
18000005115144 /altid=gi 

120000015982361 /altid=qi 19931302 /def=gbjAAG02122.1 
18000004933686 /altid=gi 16677735 /def=ref |NP_033084. 
18000005176024 /altid=gi 13811378 /def=gb|AAC69894.1| 
18000005033177 /altid=gi 16677737 /def=ref |NP_033085. 



BLAST dbEST hits: 



91 
91 
91 
91 

gi 

91 
91 
91 



12067555 /dataset=dbest /taxon=96. . 
11311915 /dataset=dbest /taxon=96. . 
12067611 /dataset=dbest /taxon=96. . 
843816 /dataset=dbest /taxon=9606 / 
3331474 /dataset=dbest /taxon=9606 
6660347 /dataset=dbest /taxon=9606 
2322877 /dataset=dbest /taxon=9606 
2348656 /dataset=dbest /taxon=9606 
4189370 /dataset=dbest /taxon=9606 
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EXPRESSION INFORMATION FOR MODULATORY USE: 

library source: 



91 
91 
91 
91 
9i 

gi 
gi 

91 

gi 



12067555 bocio- tumor 
11311915 Head-neck 
12067611 bocio- tumor 
843816 Breast 
3331474 Parathyroid tumor 
6660347 Uterus papillary carcinoma 
2322877 Colon tumor 
2348656 Colon tumor 

4189370 Pooled human melanocyte, fetal heart, and pregnant uterus 
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1 MERTAGKELA LAPLQDWGEE TEDGAVYSVS LRRQRSQRRS PAEGPGGSQA 
51 PSPIANTFLH YRTSKVRVLR AARLERLVGE LVFGDREQDP SFMPAFLATY 
101 RTFVPTACLL GFLLPPMPPP PPPGVEIKKT AVQDLSFNKN LRAWSVLGS 
151 WLQDHPQDFR DHPVHSDLGS VRTFLGWAAP GSAEAQKAEK LLEDFLEEAE 
201 REQEEEPPQV WTGPPRVAQT SDPDSSEACA EEEEGLMPQG PQLLDFSVDE 
251 VAEQLTLIDL ELFSKVRLYE CLGSVWSQRD RPGAAGASPT VRATVAQFNT 
301 VTCCVLGSVL GAPGLAAPQR AQRLEKWIRI AQRCRELRNF SSLRAILSAL 
351 QSNPIYRLKR SWGAVSREPL STFRKLSQIF SDENNHLSSR EILFQEEATE 
401 GSQEEDNTPG SLPSKPPPGP VPYLGTFLTD LVMLDTALPD MLEGDLINFE 
451 KRRKEWEILA RIQQLQRRCQ SYTLSPHPPI LAALHAQNQL TEEQSYRLSR 
501 VEEPPAASCP SSPRIRRRIS LTKRLSAKLA REKSSSPSGS PGDPSSPT5S 
551 VSPGSPPSSP RSRDAPAGSP PASPGPQGPS TKLPLSLDLP SPRPFALPLG 
601 SPRIPLPAQQ SSEARVIRVS IDNDHGNLYR SILLT5QPKA PSWRRALQK 
651 HNVPQPWACD YQLFQVLPGD RVLLIPDNAN VFYAMSPVAP RDFMLRRKEG 
701 TRNTLSVSPS (SEQ ID NO: 2) 

FEATURES: 

Functional domains and key regions: 

[1] PDOC00001 PS00001 ASNJGLYCOSYLATTON 
N-glycosylation site 

339-342 NFSS 

[2] PDOC00004 PS00004 CAMP_PHOSPHO_STTE 

camp- and cGMP-dependent protein kinase phosphorylation site 

Number of matches: 3 

1 374-377 RKLS 

2 517-520 RRIS 

3 523-526 KRLS 

[3] PDOC00005 PS00005 PKCLPHOSPHOJSTJE 
Protein kinase C phosphorylation site 



Number of matches: 


18 
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30-32 


SLR 
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342-344 


SLR 
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36-38 


SQR 
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277-279 


SQR 
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63-65 


TSK 
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99-101 


TYR 
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170-172 


SVR 
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36-38 


SQR 
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277-279 


SQR 
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TVR 
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SLR 
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16 


512-514 
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17 
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SPR 


18 


591-593 


SPR 
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[4] PDOC00006 PS00006 CK2_PH0SPH0jSrTE 
casein kinase H phosphorylation site 



Number of matches: 9 

1 40-43 SPAE 

2 221-224 SDPD 

3 247-250 SVDE 

4 256-259 TUD 

5 277-280 SQRD 

6 388-391 SSRE 

7 402-405 SQEE 

8 540-543 SPGD 

9 635-638 TSQD 

[5] PDOC00008 PS00008 MYRISTYL 
N-myristoylation site 



Number of matches: 


13 


1 


24-29 


GAVYSV 


2 


169-174 


GSVRXF 


3 


181-186 


GSAEAQ 


4 


273-278 


GSVW5Q 


5 


283-288 


GAAGAS 


6 


286-291 


GASPTV 


7 


303-308 


GCVLGS 


8 


307-312 


GSVLGA 


9 


410-415 


GSLPSK 


10 


425-430 


GTFLTD 


11 


554-559 


GSPPSS 


12 


568-573 


GSPPAS 


13 


700-705 


GTRNTL 



Membrane spanning structure and domains: 
Helix Begin End score Certainty 

1 102 122 1.600 Certain 

2 298 318 1.509 Certain 

3 416 436 0.748 Putative 
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BLAST Alignment to Top Hits: 

CRA 1 160000003781983 /altid=gi 18650435 /de£=gb|AAF78208.1|AF237669_l 
(AF237669) RalGDS-iike protein 3 [Mus musculus] /org=Mus 
musculus /taxon=10090 /dataset=nraa /length=709 
Length = 709 

Score = 1119 bits (2862), Expect = 0.0 

Identities = 570/714 (79%), Positives = 623/714 (86%), Gaps = 9/714 (150 

Query: 1 MERXAGKEIJ\LAPLQ[M3EETEIXAVY5V^ 60 

MERTAGKEUU^LQDWGEETEDGAVYSVSLRRQRSQR +P E G Q P P +TFLH 
Sbjct: 1 MERTAGKELA1J\PLQ[^EETEDGAV^^ 59 

Query: 61 YRTSKVRVlJWVPJ_ERLVGELVFGDf^ 120 

YRTSKVR LRAARLERLV ELV GDREQDP F+PAFLAT+R FVPTA +LGFLLPP PPP 
Sbjct: 60 YRTSKVRAU^AARLERLVHELVSGDREQDPGFVPAFLATH^ 119 

Query: 121 PPP — GVEIKKTAVQDLJSFNKNLRAWSVl^^ 178 

PPP GV+ K+T QDL+F4KNU^VVSVLGSWL++HPQDFRD P H +LG+VR FLGW 
Sbjct: 120 PPPPAGVDSIO^GQDIJ^FSKNIJWVSVLGSWL^ 179 

Query: 179 APGSAEAQKAEKLLEDFLEEAEREQ- EEEPPQVWTGPPRVAQTSDPDSSEACAEEEEGLM 237 

APG AEA++AEKLLEDFL+EA+ EQ EEE W+GPPR+AQT + +E C EEE G 
Sbjct: 180 APGGAEAREAEKLLEDFLKEAKGEQTEEEKRLAWSGPPRIAQTPGSEFAeX\^E 238 

Query: 238 PQGPQLLDFSVDEVAEQLTLTOLELFSKVRLYE^ 297 

4GP+LLDFSVQ+-VAEQLTL+D+ELF +VR ECLGS+WSQRDRPGAAG SPTVRATVAQ 
Sbjct: 239 SEGPELU3FSVDDVAEQLTLM3VELF1JIVRSCECL^ 298 

Query: 298 FWVTGCM.GSVLGAPGUW'QRAQRLEK^^ 357 

FNTVTGCVLGSVL APGLAA QRAQR+EKWIRIAQROIELRNFSSIJ^AILSALQSNPIYR 
Sbjct: 299 FMVTGCVLGSVLMrcU\ASQRAQ^ 358 

Query: 358 U<RSWGAVSREPLSTFRKIJ^IFSDENNHLSSRE^ 417 

LKRSWGAVSREPLS FRKLSQIFSDE-+NHLSSR IL QEE TE -f+O GSLPSK P 
Sbjct: 359 LKRSWGAVSREPI^FPJ<l^IFSDEDNHl^SRML^EETTE DDDCPSGSLPSKLP 415 

Query: 418 pgpvpylgtfltdlvmudtalpdmlegdlj^f 477 

PGPVPYLGTFLTDLVMLDTALPD L4G+LINFEKRRKEWEILARIQQLQ+RGQ Y+LSP 
Sbjct: 416 PGPVPYLGTFLTDLVMUDTALPDrTTJCGNL^^ 475 

Query: 478 ppiuv*U4aq^lteeqsyrlsrvo:eppaasc^ 537 
ppiLAAL aq ql+eeqsyr+sr\h:eppaas<^sprirrrisltkrlsakl+rek+ssp 

Sbjct: 476 PPIUXALRAQRQI^EEQSYRVSRVIEPPAASCPSSPraRRRK 535 

Query: 538 SGSFGDPSSPTSSVSFKSPPSSPRSRD-APAGSPPASrcF^PSTKLPl^LBLPSPRPFA 596 

GSPGDPSSPTSSVSPGSPPSSPR+R+ P GSPPASPGPQ PSTKL L-HO PPP 
Sbjct: 536 GGSPGDPSSPTSSN^PGSPPSSPRNREPPPPGSPPASPGPQSPSTKLSLTT^PPGPWPVT 595 

Query: 597 LPLGSPRIPLPAQC^SEARVIRVSIIMJHGNLYRSILLTSQDKAPSVVW 656 

L S R+PL QQfSEARVIRVSI+NM+IGNLYRSILLT QDKAPSW+RAL+KHNVPQP 
Sbjct: 596 LTPSSSRWLLGQQTSEARVIRVSINNINHGNLYRSILLTCQDKAPSVVQ 655 

Query: 657 V^CIMJLFQVLK^N^LIPDNAIWFYAI^PVA^ 710 

WA DYQLFQVLPGDR LLIPD ANVFYAMSP AP DF+LRRKEGT +TLS SP+ 
Sbjct: 656 WARDYQLFQ\/LPGDRELIJ[PDGA^M^AMSPAAPGDF^ 709 (SEQ ID NO: 4) 
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>CRA 1 103000001517213 /a"ltid=gi 17682471 /def=gb|AAF67280.1|AFl86779_l 
(AF186779) RGL protein [Homo sapiens] /org=Homo sapiens 
/taxon=9606 /dataset=nraa /length=768 
Length = 768 

Score = 448 bits (1141) , Expect = e-125 

Identities = 291/744 (3990, Positives = 409/744 (5450, Gaps = 75/744 (10%) 

Query: 11 U^LCP«GEETHX5AVYSVSLRRQRSQRRSPAE GPGGSQAPSPIAI^nTLHYRTSKV 66 

++ +QDWGEE E+GAVY V+L+R + Qf+ GGQP +TYTK+ 

Sbjct: 9 NSSIQC^EEVEEGAVYH\mj<RVQIQQAA^ 66 

Query: 67 RVLRAARLERLVGELVFGDREQDPSFWPAF 126 

R -HA LE+LV L+ + D +++ FL+TYR F T +L LL P E 

Sbjct: 67 RTIKAGTLEKLVENLLTAFG[X\DFTYISIFI_STYRG 126 

Query: 127 iKKTAVQDLSFNKNUttwsN^^ 186 

+ S A+ S+L +WL +DFR+ P L + +L PGS + 

Sbjct: 127 EDGSQSSSESKMVIRNAIASILPA^^ 186 

Query: 187 KAEKLLEDFLEEABlEQEEEPPQVv^GPPRVAQTSDPCSSEACAEEEEGl^PQGPQLIJ^F 246 

+A+ LLE F -HQE E G P S EEEE + + F 

Sbjct: 187 RAQNLLEQF QKQEVETDN GLPNTISFS LEEEEELEGGESAEFTCF 231 

Query: 247 SVDEVAEQLTLIDLELFSKVRLYECLGS\Ai\5QRDRPGAAG 306 

S D VAEQLT +D +LF KV + CLG 4WS+REH- +PT+RAT+-+QFMT+T CV+ 

Sbjct: 232 SEDLVAEQLTYNDAQLFKKWPHHCLGQWSRRC^^ 291 

Query: 307 GSVLGAPGLAAPQRAQRLEKWIRIAQRCREIJWFSSURAILSA^ 366 

++LG L QRA+ +EKWI IA CR L+NFSSLRAI+SALQSN IYRLK-HW AV 
Sbjct: 292 STILGGKEIJGQQRAKIIEKWINIAHECR^ 351 

Query: 367 REPLSTFRKLSQIFSDENNHLSSREILFQEEATEGS QEEDNTPGSLPSKPP — 417 

R+ + F +LS IFSD NNHL+SRE+L +E+++ ++T L + 

Sbjct: 352 RDRMLMFEELSDIFSDHNNHLTSRELLM<EGT5KFAN^ 411 

Query: 418 — FKdPVPYLGTFLTDLVMLTTTALPDMLEGDL^ 475 

G VPYLGTFLTDL MLDTAL D +EG LINFEKRR+E+E++A+I+ LQ C SY ++ 
Sbjct: 412 VMQGTVPYLGTFLTDLTMUJrALQDY^ 471 

Query: 476 PHPPILAALHAQNQLTEEQSYRI^RVIEPPAASCPSSPRIRRRISL^ LAREKS 534 

P + Q LTEE+SY LS IE A + +SP+ R+ S+ KRLS L + 
Sbjct: 472 PDQKFIQWFQRQQLLTEEESYALSCEIEAAADASTTSPKPRK — SMVKRLSLLFLGSDMI 529 

Query: 535 SSPSGSPGDPSSPTSSVSPGSPPS SPRSRDAPAGSPPASPGPQGPSTKLP 584 

+SP++ PSSSSS +AGS PPKL 

Sbjct: 530 TSPTPTKEQPKSTASGSSGESNDSVSVSSCES^ 589 

Query: 585 LSLDLPSPRPFAL — PLGSPRI PLPAQQSSEA 614 

S40 S +L PL SP P+ QQ+ + 

Sbjct: 590 SCSSIHSNtJT^SGNSSLINPI^SPPSC^FI^^ 649 

Query: 615 RVIRVSIDhO<^LYRSILLT5QDKAPSWRRALQKHN\^ 674 

+IR+ S I I I I I GN+Y+SI+LTSQDK P+V++RA+ KHIsH- A +Y+L QV+ Df L+ 
Sbjct: 650 QIRISVEDNNGNWKSIMLTSCP^ 709 
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Query: 675 IPDNANVFYflMSFVAPRDFMLRRK 698 

IPD+ANVFYAM+ DR-LR+K 
Sbjct: 710 IPDSANVFYAMNSQVNFDFILRKK 733 (SEQ ID NO: 5) 



Miner search results (Pfam): 

Model Description Score E-value N 

PF00617 RasGEF domain 226.4 4.3e-64 1 
PF00788 Ras association (RalGDS/AF-6) domain 60.6 3.8e-15 1 

PF00618 Guanine nucleotide exchange factor for Ras-1 12.0 0.079 1 

Parsed for domains: 

Model Domain seq-f seq-t hmm-f hrcm-t score E-value 

PF00618 1/1 63 113 .. 1 60 [. 12.0 0.079 

PF00617 1/1 244 454 .. 1 227 [] 226.4 4.3e-64 

PF00788 1/1 613 700 .. 1 112 [] 60.6 3.8e-15 
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1 GTCCCCGCCC CACCTCCCPG 
51 GACGTCAGAG ACCCTUTCCC 
101 TCCCACCCTG TCCCOGAAAA 
151 GCCCGAGACA GAAGACCTAT 
201 GAGGCCATCC CACAATCGCC 
251 CTCCCCGAAA CCTTTACATA 
301 CAGACGTGGC CTTGACCTAA 
351 TQCAGGTQQG CCCTTACCTC 
401 CCCCTGGGTG GAGCCTTCCA 
451 TCTACCCTGG CGGCAGGAGG 
501 CCCAGGCAGA TAACTATGTC 
551 CTGCTGAAAC TGCAGGCGCA 
601 GTT3CCACATC GCTAACCGCG 
651 AGAGATCTAG GAGTAGGGCT 
701 TTCGGGAGAC CAGATTCCAG 
751 AACGGGCGAG GTTTGACGAG 
801 AACCAGAAGA GGTGCAGGCG 
851 GTATCCGGAT GGGGAGAGGC 
901 GACCTCAGTC ATGAGGGGGT 
951 AGGGATCGGA GTGGGCAGTG 
1001 AATGGGGTAG GGAGGAGGGG 
1051 GCCCACCTTC CTCTCTCGCT 
1101 AGAGGGAAGG CTGCCCGAAT 
1151 CTTCCAAGGA CAAG1 I I I I I 
1201 CGGTTGGGGC CCCAAGCTCG 
1251 GGGCCAAGGT GGGGGTGGAG 
1301 TGGGGACCrc GATCCCACAT 
1351 GAC3GAAGAGA GTGAGGAGGA 
1401 CAAGATCCGT TCCCAGAAAT 
1451 CTCGGAGGTC CTAGACTCGT 
1501 CCCCCGAATC TCCGGGTCTA 
1551 CGCGGCCGGA CCGGGAACGG 
1601 GGGGCCGGGC CAGACGTTCC 
1651 CGGCCTCAGA GTTGCGCAAT 
1701 ATLIGI 1 1 1 1 GACCCTCCAC 
1751 AGGGCCACAG ACTGCCTCCT 
1801 TAGTGACGCA GATGGAGCTT 
1851 TTATGCCTGT AATCCCAGTA 
1901 GAGTCCAGGA GCTCCAGGCT 
1951 CAGCCTGGGC CACAGAGCCA 
2001 GGGTGCAGTC GCTCATTCAT 
2051 GCTGGGTGTC TCGCTTGAGC 
2101 GGTGAAAACC AGTTTTTACC 
2151 TGGTGGCATA TGCCTGTAGT 
2201 ATGGATTGAA CCCAGGAGGC 
2251 TGCACTGCAG CCTTGGCAAC 
2301 AAGGCTGGGC GCCGTGGTTC 
2351 GAGGTGGGCG AATCTCTTGA 
2401 ATGGCAAAAC CCCGCCTCTA 
2451 GGCGGCACCT GTAATCCTAG 
2501 TTGAATCTGG GAGGCGGAGG 
2551 TCCAGCATCG GTGACAGAAC 
2601 AATAAAAAGT GGAAGATGCC 
2651 TGGAGGACAG AGGACTCTGG 
2701 AGCTCCCCCA CATCTCGGGA 
2751 TCCTTTCTTC TGGGGACAGA 
2801 CTCTCCTGGG CTGGCGCTGG 
2851 CGGCCTGCCC TGCCCCGCCC 
2901 TGGCCTCAGC GCCCGGCCAC 



GGGAAGCTCT CACTCCCCAA GGAAGCCCAA 
GTCTCAAGCT CCGCCTCCAA GGGAAAGAAG 
GCTCGGAGAC CTATCTGCAG TTCTCGAGCA 
CCTGCCCGCA GCAGAGGGCC ATTTCTTCAG 
TCCCTCGAAT CCTTACTGTT CCCAAGAGGC 
TAGAGTTAGA GTTCAGAGGC TCCCCCATCA 
GTCCCCGCCC CCAATGGCTA GGATCGACCC 
CTAGGCCCCG CCCCCGAAAG AAAAGCCCCG 
AGAGGCCCCA CCCCTCTCGG CTGACCCGGC 
ACGGCCGCTT CGCGGGAAAG GAGCTGGATC 
CCAAACCTCC TGGGCCTCGT GGAGGAAAAG 
GCTCCAGGGC CACGACGTGC AGGAGATGCT 
AGGTGCCCTG CAGCCATGGG GCAGAGCTCC 
GGGTCGGAAA ACCAGGGAAC CGGGGGCAAC 
GCTGATCAGC CAGGAAGGGG TCTCGGGGAC 
CCAGGGGTGA GCCCGAGCCA CGAGGGAAGA 
GGTCCTGTGA GGCTAGAGGA CGCGGCTGGG 
CTAGTGGCAG GGCCCCAGAG ACCAGGGGCG 
GAAGCTGGGT ATCAAGGTGC CTGCTTAAGG 
GTCTCCAGTT GGAACTAGGG CATATCTCGG 
GAACGTGACT TCTCGGTGGC CCCTGGCCCC 
TCCCATCCCC GCTTAGTTCC TCGCCAGCTT 
ACAACACCCG CATCGCCCTG CCCCTTGCCA 
GGTCAGTATG GGTGGGGGGA TGAAGGGGTG 
GTAGCAGGAG GTTGTCAGAT ACCCTCTTCA 
TGGGGTGGTA AAGGGAAGGG GAGCCAGGGA 
CGCTCCCreC TCATCCCCCC ACCCCCCTTA 
GGACAACGAG GTAGTGACCC GCGCATCACT 
TMTCGAAAG TCACAAGAAG CACCGTCGCT 
CCTGACACCC ACCAGGCGGC CCCTTCGGAG 
GCGCACGCCA CGGGCGCTTC AGGGGCTCAA 
AGGCGGCCAG CGGCGCCCGG AGGGGAGGAA 
CACAGTAAAT CTCCCCAGCT GGGTCCGCCC 
AAATGTTACC GACCATGCCC CTGGGTATTC 
CACCCAAGAG ACGGCTGTCC CTGAAAACCC 
CCAACCTGGG TCATGATGAC TCCCATCAGC 
AAAAATGGGA GATGGCCCGA TGTAGTGGTT 
TTTTGGGAGG CTGAGTTGGG AGGATCACTT 
GCAGTGAGCT ATGATCGTGC TACTGCACTC 
GACCCTGTCT CAATAAATAA AATAAGGGCG 
ACCTATATTC CCAGCACTTT GGGAGGCTGA 
CCAGGGGTTC CAGACTAGCC TGGGCAACAT 
AAAAAAAAAA AAAAAAAAAA AAGCTGAGCA 
CCCAGCTACT TGGGAGACTG AGGCAGGAGA 
GGAGATTGCA GTGAGCCAAG ATCAAGCCAC 
AGGAGTGAGA CCCTGTCTCT AAAAAATAAT 
ATGCCTGTAA TCCCAGCATT TTGGGAGGCT 
GGCCAGGAGT TTGAGACCAG CCTGGCAAGT 
CAAAAAATAC AAAAATTAGC TGAGCATCGT 
CTACTTGGGA GGCTGAGGCA CAAGAATCCT 
TTGCAGTGAG TTAAGATCAA GCCACTGTAC 
GAGACTCCAT CTCAAAATAA TAGCAATAAT 
CCCACACTTG ATCAAGCTAG CCCCTTCCAC 
TCTGGGGACA CACACATGCC CCCACACAGG 
TACAAAAAAG ACCCCTTGGG GACAGATATG 
TTGATAGGCA CCCAGCGGAA GAGCCAGGAC 
GTCCGGCTCG AGGCACCCAG AGGCTGGGTC 
CGCCCCAGCA GCTCGGCCGC TCCGCCCCTC 
TGCCCGCCGC CCGCCACCCG CCACCCGCCG 



FIGURE 3A 



2951 GCCCTTCCGC CTCACTCAGC GGCGCCACTG AGAGGGACGG GCGCCGGCCA 
3001 TGGAGCGCAC AGCAGGCAAA GAGCTGGCCC TGGTAAGGGG ACAAQQGATC 
3051 CCOGGACCCC GCATCCCTGG TGACCCGCAG GTCCAGAAAC TCCAAGOGCC 
3101 CGCCCGTCGG ACGGTATCTG CTCCCAATCT GAACTTCCCC TGGAGTCCCC 
3151 TCCTGGGGAC TCGGGGCCCT TGACCCAGTG AAGCGACTGG TTCCTCTTAG 
3201 GGATGGGGGC GCGAGTCTCT GAGCGCAGTC GGCAGAAAGA GCTAGAGACA 
3251 GGTTCTATTA GACTGGGCCC TGGGACATCC CCAAATCCCA CCCCATGTCC 
3301 TCAGGACCTC GGAGGAGGGG ACCCGCAGCG AGGAGGGGAC TAGCCTGGGA 
3351 CCCCAGCCCT AGTCTCGCAG CTTCTGGCCG GGAAGGGGCG TGGGGATGCA 
3401 GCAGGAGGAC TCGGCCGGAG TCCGAGCGGC CAAGGAGGCT GAGGCCCCAG 
3451 GACCTGTCCC CCTTTGGTGC CCTGAGTCCG CCTGTGCGTC CAGGCACCGC 
3501 TGCAGGACTG GGGTGAAGAG ACCGAGGACG GCGGGGTGTA CAGTGTCTCC 
3551 CTGCGGCGGC AGCGCAGTCA GCGCAGGAGC CCGGCGGAGG GCCCCGGGGG 
3601 CAGCCAGGTG AGGAGGGGGT TTGGTGGGTG GCGCGGGGCC GGAAGCGACC 
3651 AGTTGAGGGC GGAGCTGGAG AGCGGAGCAC AGGCCGCCAG GTGCAGTGGG 
3701 CGGAAGGAAG GGAGGGGCTC GGAGGCGACC AGATGAGGCG ACCAGGTAGA 
3751 AAGGGGACTG GGGGCGGCCA GGTAAGTGGG GGGAGATCCA GGGAATGGGG 
3801 TGGGGCCAGG CGATGGCGGC GCAGTTCCCG AGAGGAGCCT AGGGACAACT 
3851 TGGTAAGGAC AGAACTGGAC GGCAGAGTTG GGAAAGGCAG GTTTAGAGGG 
3901 CCGGGGCTGG AAGGTGGAAT GGGGTTGGTT TAGCAAGTGG CTAGGTGAGG 
3951 GCGGATGGGG CAGCCAGTGA AGCGCGACAG GAGGGCTGAG GGAAGCCCTG 
4001 GGTGGAAAAG AGTGTGTGGG GCGGGGGGGG GGGGGTGGGG GGAGGGGACG 
4051 GGAGGGGGAG GGGACGGGAG AGGGAGTAGG GGACAGGGCA TGGGAGAGGG 
4101 AGGGTTTCCA GGGCAAGTTG CAGGAGCTAT TTGTGGATGG GGAGGAACAA 
4151 TAACTTCAAG CGGGCAGGGA GTGGGGCACA CACCTATAAT CCCTCCGCTT 
4201 CGAGAGACCA AGGCAGAAGG CCAGGAATTG GAGACCAGCC TGGACAACAC 
4251 AGCAAGATTC TCTCTAATAA AAATAAAAAT TAAAAAACTA GCTGTGGGTG 
4301 ATGATGCCCA GCAGTGGTCC CAGCTACTCA GGAGGCTGGG GCAGAGGGAC 
4351 CGCTTGAGTC TAGGACTTGG AGGCTGCAGT GAGCTATGAT TGTGCCACTG 
4401 CACCCCAGCC TGGGCAACAA AACAAGACCT GTTTCTAAAA AAAACAAACC 
4451 AAAACAATAA CTCCAAGAAG CCGGGAGACA GAGGAATCAC ATGAAAGAAT 
4501 GGTGCTACAG GCGGGGCGAG GTGGCTCACG CCTGTMTCC CAGCACTTTG 
4551 GGAGGCGGAG GCAAGTGGAT CATCAGGTCA GGAGTTCAAG ACCAACCTGG 
4601 CCAAGACGGT GAAACACCGT CTCTACTAAA MTACGAAAA AACTAGCTCG 
4651 GCTTGGTGGC GGGTGCCTGT AATCCCAGAT ACTTGGGAGG CTGAGGCAGA 
4701 GAATTGCTTG AACCCAGGAG GGGGAGGTTG CAGTGAGCCA AGATCACGCC 
4751 ACTGAACTCC AGCCTAGGTG ACAGAGTGAG ATTCTCTCTC AAAAAAAAAA 
4801 AAAAAAAAGT GGTGCTAGGG GCTGGGCACG GTGGTTCACG CCTGTAATCC 
4851 TAGCCCTTTG GGAGACTTTG GGAGGCCAAG GGGGGCAGAT TACTTGAGGT 
4901 CAGGAGTTCG AGACCAGTCT GACCAACATG GTGAAACCCT ATCTCTACAA 
4951 AAATACAAAA ATTAGCTGGG CTTGGTGGTG TGCGCCTGTA GTTTCAGCTA 
5001 CTTGGAGGCT GAGGAAGGAG GATTGCTTGA ACCCAGGAGG CAGAAGTTGA 
5051 AGTGACCCAA GATGGTGCCA CTGCACTCCA GCCTGGGCAA CAGAGTGAGA 
5101 CTCTGTCTCA AAAAAAAAAA CAAAAAAAAA AAGAGTGGTG CTAGTGATGA 
5151 ATGTGACTAG AGAAGGGGTG CTGTGAGGAC CACTCCTCCT CTCTCATGGC 
5201 CACCTCTCCC CTCCTGCAGG CTCCCAGCCC CATTGCCAAT ACCTTCCTCC 
5251 ACTATCGAAC CAGCAAGGTG AGGGTGCTGA GGGCAGCGCG CCTGGAGCGG 
5301 CTGGTGGGAG AGTTGGTGTT TCGAGACCGT GAGGAGGACC CCAGCTTCAT 
5351 GCCCGCCTTC CTGGCCACCT ACCGGACCTT TGTACCCACT GCCTGCCTGC 
5401 TGGGCTTTCT GCTGCCACCA ATGCCACCGC CCCCACCTCC CGGGTCAGTA 
5451 GCGAACCATA ACCTCCGTAT TCTCCACCCT AGAACCCCAA CTGGGCACCC 
5501 CCCTCCACCT CCTCAGGTGT GGAACCTGGA AACACCTCCC AGACCCAGAG 
5551 CCCTCTTCCT AAGCCCCCTC TAGGTTCCCC CTTCTTCACC TGCTGGGGGG 
5601 CCTCTTCCCA GGGTAGAGAT CAAGAAGACA GCGGTACAAG ATCTGAGCTT 
5651 CAACAAGAAC CTCAGGTGGG TCOTCATCC AGATAGGGGA GTGCGGGGAG 
5701 GGAAATCCAA GAGGTCAAAG GTTAGCAGTC GGACTGGGGT TTTGAAAATT 
5751 GCAGGTTGGG TAATAAGAGA CTGGGAGTCA GGTGGGGCGT GGTGGCTCAT 
5801 GCCTGTAATC CCAACACTTT GGGAGGCGGA GGCAGGTGGA TCACTGGAGG 
5851 TCAGGAGTTA GAGACCATCC TCGCCAATAT GGGGAAACCC TGTCTCTACT 
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5901 AAAAATACAA CAACAACAAA AAAAGGTAGC TGGGTCTGGT GGCGCATCCC 
5951 TCTACTCCCA GCTACTCGGG AGGCTGAGGT TGCAGTGAGT CAAGATCAGG 
6001 CCATTGCACT GCAGCCTTGG TGACACAGTA AGACTCTATC TCAAAAAAAA 
6051 AAAAAAAAAA MGGTACCAG GAGTCATATT CTATGTCCCC CACTCTGGAC 
6101 CCAGCTCTGA GACCCTGCCT CTCTGGCCAG GGCTGTGGTG TCAGTGCTGG 
6151 GCTCCTGGCT GCAGGACCAC CCTCAGGATT TCCGAGACCA CCCTCTCCAT 
6201 TCGGACCTGG GCAGTGTCCG AACCTTTCTG GGCTCGGCGG CCCCAGGGAG 
6251 TGCTGAGGCT CAAAAAGCAG AGAAGCTTCT GGAAGATTTT TTGGAGGAGG 
6301 CTCAGGGAGA GCAGGAAGAG GAGCGGCCTC AGGTGTGGAC AGGTGAGGGG 
6351 TTTTCAGATC CAGTCGTGTT CTGAGAAGGC CTTTCCTGTC TGCTTCTTCC 
6401 CACACAGGCT TTCTCTCCCC TCTCAGAGCT ACAAAACTTA AGCAAGATTT 
6451 TAAACTCTAA GCCTCAATTT CTTCATCTTT ACAATGGGGA TAATAATTCT 
6501 TTGTCAGCGG GGCGTGGTGG CTCACGCCTG TATCCCAGCA GTTTGGGAGG 
6551 CCAAGGATGG TGTATCACCT GAGGTCAGGA GTTTGAGACC AGTCTGACAA 
6601 ACATGGAGAA ACCCCATCCC TACTAAAAAT ACAAAATTAG CCGGGCGTGG 
6651 TGGGGCATGT CTATAATCCC AGCTATTCGG GAGGCTGAGG CAGGAGAATC 
6701 GTTTGAACCC GGGAGGGGGA GGTTGGGGTG AGTCGAGATC GTGCCATTGC 
6751 ACTCTCGCCT GGACAACCAG AGCGAAACTC CGTCTAAAAA AAAAAACAAA 
6801 TTLI I IGICT GAAGTATTAG CATGTGTCTA ATACTTTTCC CTCCTTGGTG 
6851 CCGTTGGGTC AGGATGCTCT GTGTTTCTAG CTACAAACCA TTGCCTTGAT 
6901 ACTTGTCTTT Al I I ILI I I I TTTTGAGTCA GGGTCTTGGT CCGTTCCTCA 
6951 GGCTGGAGTC CAGIGTCTCC ATCATGGCTC AGTGCAGGCT CAACATCCTG 
7001 GACTCAGGTC ATCCTCCGGC CTGGGTCTCC AAAACTGCTG GCATTACAGG 
7051 CGCGA GCCAC TATACCTAGC CTGTAAAATT I I I LI IATTT TTCAATTTCT 
7101 TTTTAAATTT AATTTAATTT AATTTTATTT TTTTATCTAT I I I I I I 1 1 1 I 
7151 AGACAGAGTC TCGCACTGTT ACCCAGGCTG GAGTGCAGTG GCACAATCTT 
7201 GGCTCACTGC AACCTCCACC TCC7GGGCTC AAGCCATTCT CCTGTCTCAG 
7251 CCTCC TGAGT AGCTCGGACC ACAGGGGCAT GTCACCACGC CCGGCTMTT 
7301 1 1 1 1 IGTAAA GGTGAGGTTG TGCCATGTTG CCCAGGCTGG TCTCAAACTC 
7351 CTGAACTCAA GTGATCTCCC TGCCTTCGCC TCCCAAAATG CTGGGATTAC 
7401 AGCCATAAGC CATTGTGCAT GCGTAGCCTC CTTACTTGAT TATTGGCTTT 
7451 TGCTCATCTC ATAGGCTGTG AGTCCATGAG AGGAGGACCT G l IGI I LI I G 
7501 CTCCCAGCTC TGTCCCCAGG GGCAGGAACA ACACAGATTA GTTTGCTGAA 
7551 TAATTGCATC CTGCTTAGGA AGTATCATCT TTCACCCATC TGTATTTGAT 
7601 CTGATCCACA TCACAAAAGC ATCTCTATCC CTAATCCCCA TGGCTTAATC 
7651 TCCAGATTAT AGAGGCCACC TTCCTGTCCA ATTTACAAAG TAGCAGCCAC 
7701 TTCTCTATCC CTGGTGACAA AGTCTCAGTT ATTTATATAT ATATAAAGGT 
7751 ATATATATAT ATATATATAT ATATATATAT ATATACATGA AGGTGTATAT 
7801 ATATATATAT ATATGAAGGT ATATATATAT ATGTATATAT ATGAAGGTAT 
7851 ATATATATAA AGGTATATAT ATATAAAGGT ATATATCTAA AGGTATATAT 
7901 ATATATAAAG GTATATATAA GGGTATATAT ATATAAAGGT ATATATATAT 
7951 ATATGAAGGT ATATATATAT ATGTATATAT ATGAAGGTAT ATATATATAA 
8001 AGGTATATAT ATATAAAGGT ATATATATAA AGGTATATAT ATATATAAAG 
8051 GTATATATAT ATAAAGGTAT ATATATATAT AAAGGTATAT ATATATAGGT 
8101 GTATATATAT ATATATATAT ATATATATAT ATATATATAT ATATGATTTC 
8151 TCCAGCTGAT TCCAAGTCAT TAGAGCTCGA TAGTTCACTG TGGTATCCAC 
8201 TAGCACCTGT CGCTATTTAA ATTAATTAAA ATTGGCTGGG CGCGGCGGCT 
8251 CATGCCTATA ATCCCAGCAC TTTGGGAGGC CGACGGGGGC GGATCCCAAG 
8301 GTTCGGAGAT CGAGACCATC CTGGCTAACA TGGTGAAACC CCGTCTCTAC 
8351 TAAAAATACA AAAAAATATT AGCCGGGGGT GGTGGCGAGC GCCTGTAGTC 
8401 CCAGCTACTC GGGAGGCTGA GGCAGGAGAA TGGGGCGAAC CTGGGAGGCG 
8451 GAGCTTGCAG TGAGCTGAGA TCAAGCCACT GAACTCCAAC CTGGGTGACA 
8501 CAGCGAGACT CTGCCTAAAA AAAAAAAACC AAAAAACAAA ATTATAATAA 
8551 TAATTAATTA ATTAATTAAA ATTAAATAAA ATTCAGGTCT 1 1 IL I I 1 1 I A 
8601 GAGATGGGGT CTTGCCATGT TGCCCAGGCT GGTCTCGAAC TCCTGGGCTT 
8651 AAGCAATCCT CCAGCATCAA CCTCTCAGAG TGLTGGGATT GTAAGTGTGA 
8701 GCTACTGTGC CTGACCCTGC LI 1 1 I I 1 1 I I 1 1 1 1 1 1 I 1 1 1 TTTTGAGACG 
8751 GAGCCTCGCT CTGTCACCCA GGCTGGAGTG CAGTGGCGCC ATCTCGGCAC 
8801 ACTGCAACCT CCTCCTCCCA GGTTCAAAAG ATTCTCCTGC CTTAGCCTCC 
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8851 CAAGTAGCTG GGATTATAGG CACCTGCCAC CACACCCAGC TAAI I I PGTA 
8901 I I I I I ATTAG AGACAGCGTT TCACTATGTT GGTCAGTCTG GTCTTGAACT 
8951 CCTGACCTCA GGTGATCCAC CCACCTCGGT CTCCCAAAGT GCTGGGATTA 

9001 CAGGCGTGAG CTACCATGCC TGGCCCGCTT 1 1 1 1 1 I I I 1 1 1 1 I I I I I < I I 

9051 TTTCAAAATC CAGTCAAGCA AAGGCAAAAA TTCAGGTCTT CAATCCCACT 
9101 ACCCAC ATTT TGAGTGCTCA GCCACCACAC TGGACATAGC AGATAGATAA 
9151 II I I ICCACC ATTGCAGAGA ATTATATGGA AAGTGCTGCC CTAGTTTCTT 
9201 TCAGGTCAGA GGAGAAAATT AACATTTGTT TAAGACCTTC TATGTGCTAG 
9251 GCCCTGGGAC ACACTTTATT TCATTTTATT TTATTTTATT TAI 1 1 1 I ACT 
9301 TTTATTTTAT TTPGAGACAG AGTCTGGCTC TGTCGCCTAG GCTGGAGTGC 
9351 AATGGGGCGA TCTTGGCTTA CTGCAACCTC CACCTCCTGG GTTCAAGTGA 
9401 TTCTCCTGCC T CAGCCT CCT GAGTAGCTGG TACTACAGGC GCCCGCCACC 
9451 AGGCCCAGCT AATTTTTTGT AGTTTTAGTA GAGACGGGGT TTCACCGTGC 
9501 TAGCCAAGAT GGTCTCGATC TCCTCATCTC GTCATCCGCT TGCCTCGGTC 
9551 TCC CAAAGTC CTGGGATTAC AGGCGTGAGC CACCGCACCC GACTATGAAT 
9601 TTTAI I 1 1 IA GATACAGGGT CTTGCTCTGT TGCCCAGGCT GGACTCGAAC 
9651 TCCTGGGCrC AAGTCAGCCT CCTACCTCAG CCTCCTGAGT AGCTAAGACT 
9701 ACACTTGCAC CATGTAGTTT AGAAGAAAGT AGATGACCAC CATGCTCATC 
9751 TATTTTATTT TAACAACTTT ATTTTGGGTT CACTTTTTGC TATGGAAAAT 
9801 TTCAGACATA TACAAAAGTA GAGAGAATAG TATGAAGAAC ATTCAGACAT 
9851 CCATCACCTA TCATCAACGA TGATCAATTT CACAAAAAAA TATTTTCAGG 
9901 ATCATTTTAA AACAAATCCC GGGCTTATGT CAATTCATAC ATAA A I G I I I 
9951 TQGGTACACA TGTCTGACAA CAGGCTTACT I I I I I I I I I I 1 1 I I I I I IGA 
10001 GACGGAGTTT CGCTCTTGTT GCCCAGGCTG GAGTGCAATG GCAGGATCTC 
10051 AGCTCACCTC AACCTCTGCC TCCTGGGTTC AAGTGATTCT CCTGCCTCAG 
10101 CCTC CCGAGT AGCTCGGATT ACAGGCGTGC ACCACCACAC CCGGCTAATT 
10151 TTCTATTTTT AGTAGAGAGG GGGGTTTCTC CAIGI I IGTC AGGCTGGTCT 
10201 CGAACTCCTG ACCTCAGGTG ATCGGCCCAC CTTGGCCTCC CAAAGTGTTG 
10251 GTATTACAGG CGTGAGCCAT GGCGCCCGGC CCTTTTATTT TT A I I I 1 1 I A 
10301 ATAACCTTCA TGTTCATACT TAAAAAAAAA TCAGAAATAT TTGATATAAA 
10351 AAAAATCCAA TCCAGGCCAG ATGCAGAGGC TCCTGCTGGC GATCCCAGCA 
10401 TTTTGGGAGG CCAAGGCAGG TGGATGGGCT TTGAGCCCAG GAGATTGAGA 
10451 CTAGCCTGGG CAACATGTPG AAACTTTGTG TCTACAAATA ATTAGCTGGG 
10501 CATGG TGGTG ACTCCCTATA GTCCCAGCTG CTTGGGAGGC TGAGGCAAGA 
10551 GGATCATTTT AGCCTGGGAT GGTCAAGGCT GCAGTGAGCC GTGATTATGC 
10601 CACTGTACTC CAGCCTAGGT GACAGAGCGA GACCCTGCCT CAAAAACAGA 
10651 AA AAATACCC AGTCTATATT CAAATATTCA AATCCCCTGT TTGTGCCTGA 
10701 ACL I I I I I 1 1 GGACACTGGG TTTTCCTATT TTGCCTCGGC TGGGCTTGAA 
10751 CTCCTGACCC TCCCACCTCA GCCTCC7GAG TAGCTGGGAC CACAGGTGCC 
10801 CACCATGGCA CCCAGCCCTA AATTTTCTTT TGACAGTTGT TTCTGGCCAG 
10851 G7GTTGTGGC ACATGCCTAT AGTCCCAGCT ACTTAGGATG CTGAGATCGG 
10901 AGGATCTCTT GACTCCGGGA AATCAAAAGC TGCCGTGAGC TGTGAGCATG 
10951 CCCCTGCACT CCAGGCGATA GAGCTGGGGG AAGGAGGAAT AGTTGTTTCT 
11001 TCAAATTGAA ATCCAAAGAT CTACTCAAGG TATTTGGTTG TTTGCTTCTC 
11051 1 1 I I 1 1 I I 1 1 I I I I I I I I I I TTTGAGATGG AGTCTCACTC TGTTGTCCAG 
11101 GCTGGAGGGT AGTGGCGTGA TCTTGGCTCA CTGCAACCTC CGCCTCCTGG 
11151 GTTCAAGCGA TTCTCCTGGC TCAGCCTCCT GAGTAGCTGA GTTTACAGGT 
11201 GCCCACCAAC ACGCCCAGCT AATTTTTGTA I I I I IAGTAG TGAGGGGGTT 
11251 TCACCATGTT GGCCAGGCTG GTCTTGAACT CCTAACCTTT AGTGATCTGC 
11301 CCACATCGGC CTCCCGAAGT GTCGGGATTA CAGACATGAG TCACCACGCC 
11351 CTACCGGTCG TTTGTTCATA AGTCTLl 1 1 I ATTCTGTAAC AGATCCCCCT 
11401 TGCCTCTTGT TTGAAGCCAT TAGAGGGCAA AAAAAATGGG TC A I I I I I C C 
11451 TGAGGTATGT CTCACATTCT TTTCGACTTA CCTCATGGTT TCATGCAGCA 
11501 TGTTTCTCTA TCCCC ATAAT TGCTGTAAGA TTTAAAGGTT TGATTAGATG 
11551 TAGGGCATTT I I I 1 1 ICCAG GGCCCACTTT I I I I IGGGGT GGGGGGAGGA 
11601 GAGACAGTTT CTTGCTCTGT CACCCAGGCT GGAGTGCTAT GGCATGATCA 
11651 CAGCTCACTG CAGCCTTGAC CTCCTGGGCT CAAGAGATCC TCCCTCCTAA 
11701 GCCTCTTGAG TAGGTGGGAC AGCAGGTGTG CATCAGGATG CGCAACTTTA 
11751 AAAAI I I 1 1 1 TATGTAGACA TGGGGTCTCA CTACGCCGCC CAGGCTGGTC 
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11801 TCAAAcrccr ggtctcaagc AATCCTCCTA CCTCAACCTC CAAAAGTGCT 
11851 GGGACTATAG GTGTGCCCAG CCCAGTACCC ACTTCTAAAA ACTAATATTT 
11901 TCCAATGCCA CCTGTCCTAA TTCAAGATCA AAGAGGTAAT TACACAGATT 
11951 TACAAAGATT ATTTTAAAAT AATAGTATTG GGGCAGGGTG CTATGGCTCA 
12001 TQCCTGTMT CCCAGCACGT TGGGAAGCCG AGGCAGGAGG ATCACCTCAG 
12051 GTCAGGAGTT GGAGACCAGT CCGGCCAACA TGGTGAAACC CCATCTCTAC 
12101 TAAAAAAAAT AAAAAATAAA ATAAAATAAA ATAAAAAATA AATAAATMT 
12151 AAAAAAATAT ATATATATTT AAATTAGCTC GCTGGGCATA GTGGCACCTC 
12201 CTGTAGTCCC AGTTGCTCAG GAGGCTGAGG CAGGAGAATT GCTTGAACCC 
12251 TGGAGGCAGA GGTTGCAGTG AGCCGAGATC GAGCCACTGC ACTCCAGCCT 
12301 GGGCGACAGA GCAAGACTCC ATCACAAAAT AAAAAAATAA AATAAAATAA 
12351 TAGTATGATG CCATAACTAG TACAAAGGAG AAGGAAAGTG AGAGTAACTT 
12401 ACACAGCAAT AAACCATGTT TTCAATGGGT AATCCTTCGG TATGCCCCAC 
12451 TAGGACACAT GATGAGGTTG TCCCGTGTCT TTGCCTGTCC TAGCGTCACA 
12501 GTAGAGTGTC ACGGTGCTGT TGTACTGACA GCAACAAGCA CCAACGAACG 
12551 CA CAGGAG GG CACTGGTGAG GCAAAGACAG CAACATAGGT TCTGGGGACA 
12601 TCATTTTCCA AACTTGTGAA CAACATTTGC AATTTGCAAA CAAAACAAAG 
12651 CCCAGACTTT CGTGGTCCTT GCATTCTTGG AGGCAAAAAA ATTTGTGTTT 
12701 ATGAACAAAA TAGTCAGGTT CTAGGTGCAT ATTATTGCAA ACATGTTTTT 
12751 CTTTTCTTTT TGI 1 1 I IGI I TGTTTGTTTG TTTTGTTTTG I I I I G I 1 1 I I 
12801 TGAGATGGAG TCTGGCTCTG TCGCCCAGGC TGGAGTGCAG TGGCATGATC 
12851 TCGGCTTACT GCAAGCTCCG CCTCGCCGGT TCACGCCATT CTCCTGCCTC 
12901 AGCCTCC TGG GTAGCTGGGA CTACAGGCGC CCGCCACCAC GCCTGGCTAA 
12951 1 1 I I I ICTAT I I I I IAGTAG AGACGGGGTT TCACCATGTT AGCCAGGATG 
13001 GTCTCGATCT CCTGACCTCG TGATCTACCC GCCTTGGCCT CCCAAAGTGC 
13051 TGGGATTACA GGCGTGAGCC ACTGCCCCCG GCCTTCTTTT CTTTTCTTT1 
13101 1 1 I I 1 1 1 1 1 I TTGAGACAAA GTCTCTGTCA CCCAGGCTAG AGTGCGGTGG 
13151 CGTGGACCTG GCTCACTGCA ACCTCCACCT TCTAGGTTGA GGTGATTCTC 
13201 TAG CCTTAGC CTCCCGAGCT GGGATTACAG GCACTTGCCA CCATGCTCAG 
13251 CTGAI I I I IG TAI 1 1 I IAGT AGAGACAGGG TTTCGCCATG TTGGCCGGAC 
13301 TGGTCTCGAA CTCTTGACCT CAAGTGATTC GCCTGCCTTG GCCTCCCAAA 
13351 GTGCTGGGAT TACATGTGTG AGCCACTGTG CCAGACCCCT TCTTCCTTTC 
13401 TTAAAGACAA GTCAAGTGCA GTAGTGAGAA GGGGGGAAAG AGTAGAACAA 
13451 GGAGTTCGAT CTGTAACTGT GAACAATCAA TTGAGATAAG TCACTACCTT 
13501 GGGACCAGCC ACAAACAGGT TTTTCAAAGA CACAAATGTC TGGAGATACA 
13551 TTTGGAGGCT AGAGGGCACA ATTCAGGATC CCAGTTTCCA AAGTTTCCCC 
13601 TCCAGGGTGC CACCATCAAA ATCCACTAAA GTAAAATTAT TCATATTTGT 
13651 TCAGCACTTT ATAGCAGTCT GGTAGCATGA TCI I I I 1 1 1 I I I I I 1 1 I G AG 
13701 ATGGAGTCTC GCTCTGTCGC CAGGCTGGAG TGCAGTGACA GGATGTGGGC 
13751 TCACTGCAAG CTCGGTCTCC AGGGTTCAAG CGATTCTCCT GCCTCAGCCC 
13801 CC CGAGTAGC TGGGATTACA GGCGCGTGCC ATCACGCCCG GCTA AI I I I I 
13851 GTAI I 1 1 1 1 1 TTAGTAGAGA CGGGGTTTCA CCGTGTTGGC CACGCTGGTC 
13901 TCGAACTCCT GACCTCAGGT GATCCACCGG CCTCGGCCTC CCAAAGTGCT 
13951 GGGATTACAG GCGTGAGCCA CAGCGCCCGG CAGCATGATC TTAAACGAAA 
14001 ACAAAAACGA AATCCACAGC CAGGCGCACT GGCTCACACC CGCAATCCCA 
14051 AAACTTTGGG AAGCCAAGAG GGAGGATCGC TTGAGCCCAG GTGTTTGAGA 
14101 CCAGCCTGGC AACATAATGA GACCCTGTCT CTACAAAAAA TAAAAAATTA 
14151 GCTGGGCATG GTGGTGTGTG CCTATAGTCC CAGCAACTCA GGAGGCTGAG 
14201 GCAGGAGGAT CACTGGAGCC CAAAAGGTTG AGGCTGCAGT GAACTGTGAT 
14251 CACACCACTG TACTCCAGCC TGGGTGACCA AGGGAGAGCC TGTTTCAAAA 
14301 AGAAGGCACA GCTTACCCCT GCAATCCCAG CACTTTGGGA AGTCGAGGCA 
14351 GGCAGATCCC TTGAGGTCAG GAGTTCAAGA CCAGCCTGGC CGACATGGTG 
14401 AAACCCTGTC TCTACAAAAA TACAAAAGTT AGCTGGGCGT GGTGGCTCAG 
14451 TGCCTGTAAT CCCAGCTACT TGGGAGACTG AGGCAGGAGA ATTGGTTGAA 
14501 ACCTGGAGGC GGAGGTTGCA GTGAGCCAAG ATCACGCCAT TGCATTCCAG 
14551 CCTGGGCGAC AGAGTCAGAC TCCGTCTTAA AAAAAAAAAA AAAAAGGCAC 
14601 AGAGAG GTTA AAATACATGC TCTACACAGC AAGCTAGTGG ACGAGTTTGC 
14651 ATCTGAGTTT GAGACTTTCT GACAATAGCC TTCCCTGAAC CAGGAAGTCG 
14701 TATCACCTCT TTCCAAAAAA AAGAGGTCAG ATTAATCTTA TCCTAATACA 
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14751 TCTTAAAAAT CATAAAGCTC TATTTTCTTC TCTGGCCTTT GAGTACCCGG 
14801 CTTCAAACCC CTGCCCTGCC ATTTACCAAA GGTGTGACAA ATTCTTCTTl 
14851 GCCTCCCTTT CCTTAATTGT AAAAGGTGGA TAAATAATAG TACCTCCCTC 
14901 ACTGGACTCA CAGTAACTCA GTGGTGAGTT ACTGAGTAAA TCCACACTAG 
14951 CTGCTTAGTG AACATTACTG TTGCTGTTAC ATCCTTAAAA ACACTCAGGG 
15001 CCAGGCGTCG TGGCTCACAC CTGTAATGCC AGCACTTTQG GAGGCCAAQG 
15051 CGGGCAGATC ACTTGAGGTC AGGAGTTTGA GACCAGCCTG GCCAACATGG 
15101 TGAAAGCCOG TCTCTATTAA AAATACAAAA ATTAGCCGGG CATGGTGGCA 
15151 CATGCCTGTA ATCCCAGCTA CTCAGAAGGC TGAGGCAGGA GAATCACTTG 
15201 AACCCAGGAG GCGGAGGTTG TGGTCAGCTG AGATTGCGCC ATTGTACTCC 
15251 AGCTTQGGCA ACAGAGTAAG ACTGTCTCAA AAAAAAAAAA AAATTTAAGA 
15301 GAGCTCTCOG TTTTACAAAT GAGGAAAGTG AGCXTCAGAG AGGGACAGGG 
15351 ACTCACCCAA GGTCACACAG CCAGTCTTGG ATTCAAACTT GAGAGT7TGT 
15401 AACCCTTTCT AATGATCAGG ACCTCCCAGA GTTGCCCAAA CTTCTGACCC 
15451 AGACTCTTCA GAGGCCTGCG CGGAGGAAGA GGAAGGGCTC ATGCCTCAAG 
15501 GTCCCCAGCT CCTGGACTTC AGCGTGGACG AGGTGGCGGA GCAGCTCACC 
15551 CTCATAGACT TGGTGAGGAT CCCGGACAGG GTGGGGATCA GCCACAGTGA 
15601 GGGGACAGGT TCTGCTAAGC ACCAATCCCA CACCCCTCCC CTGGCCCAGG 
15651 AGCTCTTCTC CAAGGTGAGG CTCTACGAGT GCTTGGGCTC GGTGTGGTCG 
15701 CAGAGGGACC GGCGGGGGGC TGCAGGCGCC TCCCCCACTG TGCGCGCCAC 
15751 CGTGGCCCAG TTCAACACCG TGACCGGCTG TCTGCTGGGT TCCGTGCTCG 
15801 GAGCACCGGG CTTQGCCGCC CCGCAGAGGG CGCAGCGGCT GGAGAAGTGG 
15851 ATCCGCATGG CCCAGGTGTG TTGCGGGGGC GGAGAGGGGA TGCGGGGGCG 
15901 GGCCCTGGGG CAAGGGGAAA AAATGAGGGC TCGGGAGAGA GATAGGGGCG 
15951 AGTCTAGGGG AGGGAGGGAA CGGGGTGGAA AGTTGATACC TAGGGTGAGA 
16001 CTTGGGTTCA GGGAGGAGGG TCTGGGTCCT GCAGAGAGGC CGCGGGCACG 
16051 ACTAGGTCCC AAGGGAGCTG GGAGAAGTAG GGAGCCCGGA CCGGAGAAGT 
16101 CAAGGTCGGA GGCAGGGGCT GGAGGGGCAG CTGGGGAGGG GCTGGAGCCC 
16151 GAGGGAGGAG GGAGGAAGGG AATCCTAGGG AATAAGTGGG AGTCTTGGTA 
16201 GCTTGTCGGA TGTGAGACAA CACCCAGGGG TCGGACCTCG GGTCACAAGT 
16251 CACGGGATCA GGCTGGGCGC AGTGACTCAC GCCTGTAATC CCAGCACTTT 
16301 GGGGAGAGGG AGGATGGCTT GAGCCCTTGA GTTTGAGACC AGCCTAGGCA 
16351 ACATAG7GAG ACCAATGTTT CTAGAAAAAA AAAAAAAATT AAAAAAATTA 
16401 AAAATGAGAC TTACAAAAAA ATTAGCCGGG TGTGGTGGTG TGCCCCTGTA 
16451 ATTCCCAGCT ACTTGGGAGG CTGAGGCAGG ATAATCACTT GAACCCGGGA 
16501 GGCGGAGGTT GCAGTGACTC GAGATCGGGC CACTGCATTC CAGCCTGGGG 
16551 GATAAAGCGA GACTCTGTCC AAATAATTAA TAATAATAAT AATAATAAGC 
16601 CATGCATGAT GGCGCGCGCC TGTAGTCCCA GCTACTGAGG CAGGAGGCTG 
16651 AGGCATGTGG ATCGCTTAAG CCCAGGAATT CCAGGCAGCA AGTGAGCTAT 
16701 GATCGAGCCC CTGCACTCCA GCCTGGGCCA CAGACCCTAT TTTTCAAAAA 
16751 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAGAT GAAGAAGTTT 
16801 CAGGATGAAA GGTGGATAAT GCCTGGGTCT GACCTGCGTC CCCACCGCCT 
16851 GGCAGCGCTG CCGAGAACTG CGGAACTTCT CCTCCTTGCG CGCCATCCTG 
16901 TCCGCCCTGC AATCTMCCC CATCTACCGG CTCAAGCGCA GCTGGGGGGC 
16951 AGTGAGCCGG TGAGCTGGGG CGGGACCTGT TCCCCAGCCC ATCCCAGGTC 
17001 TGACCCTCCC AAGCCACTGA CCCCTGACCA CCCTTCTCCT GTCCTTCCAG 
17051 GGAACCGCTA TCTACTTTCA GGAAACTTTC GCAGATTTTC TCCGATGAGA 
17101 ACAACCACCT CAGCAGCAGA GAGATTCTTT TCCAGGTAGA GATGGATGCA 
17151 GACTCCAGGG ATTTTAGGCC CGGGAAGTCG GGGGAGGGAC TTGGGGCCAG 
17201 GCAGGGGTAA TCTCCCTGCT ATAGTCAGGA CACTCTGTGC TTCCCTACCG 
17251 CTCAGCAATG ACCTTATCCT TGTCCCTGGC GGGTTGCACG TTTTTCTTTC 
17301 CTCTACTTCC TGCGTTATAG TTGACTGTCA GTGACTGCCC TATTTATTCA 
17351 CTCAGCAAAA CACAAGAAGT CACAAAGAAA AGGTTACTTA AGGCCAGAGT 
17401 CATAGCACAG GGTGGGAACA AAAAAAATGT TCTGAGGACT TTACCTTGAT 
17451 AAGCAAAACr AAAAAATGTG TGTCAAAAGT CTGGCTTATT TATAGGCAAG 
17501 ATTTAGATTC TCATTGCAAT CAGGCGCTGG TTTTTAGAGT GAATCTAGAA 
17551 TGGATCCCTG GGCCTGGAAC ATTCTCCACC CCTCCAGGTT TGCATGCAAC 
17601 TTGCTCACTC ACCTCCTTCT GGTCTCTGAT TAAATGTCCC TGCCTCTGAG 
17651 AGGCCTTCCC AGCCTCCATC ATCCCCAAAA CCACACATCT GGI I I I I IGT 
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17701 TCTTCTTGTT GTTGTCGTCA TCATTTGTTT TTTTGTTTCT TTGI I IGI I I 
17751 GTATTGAGAC AGAGTCTCGC TCTGTCACCC AGACTGGAGT GCAGTGGCAC 
17801 GATCTTGGCT CACTCCAACC TCCACCTCCC AGGATCAAGC AATTCTCTCT 
17851 GCCTCAGCCT CCCATGTAGC TGGGATTACA GGCACCCACC AOGACGCTTG 
17901 GCTAATTTTT GTATTTGGTA GAGACGGGGT TTTGCCGTGT TGGCCAGGCT 
17951 GGTCTCGAAT CCCTCACCTG AQGTCATCCA CCTCCCTTCG CTTCCCAAAG 
. 18001 TGCTGGGATT ACAGGCGTGA GCCATCACTC CCAGCCAAAT TTCACCTGGC 
18051 TAACAGAGTG AAACCCTGTT TTCCTGCCCA GCACCTAGAA CAGCACGTGA 
18101 GCTGGGCTCA GTGACTCACG COATAATCC CAACACTTTC GGAGGCCAAG 
18151 GTGAGAGGAT CACTTGAGCC CAQGAGTTCA AGACCAACCT GGGCAACATG 
18201 GCAAAACCCC ATCTCTGCAA AAAATACAAC AATTAGCPGG QCGTTTGTGG 
18251 TGCACGTCTC TAGTCCCAGC TATTCAGGAG GCTGAGGAGG GAGGATGGCT 
18301 TGAACTTGGG CGGTCAAGCC TTCAGTGAGC CAAGATCAGG CCACTGCACT 
18351 CCAGCCTGAG TGACAAAGTG AGACTCCATC TCAAAATAAA ATGAAATAAA 
18401 AAGTAAGTAA ACAACAGCAA ATTCAGGATA CCCAGGAGAT CCCTGGCAGG 
18451 CCTGTGCCAT CCAGCTGCGG ACAAGGATTC TCTCCTTGTT AAGGCCAGCC 
18501 CTGGGGGCCA CTACCCACAA GCCCCACCTC TCATGGGGCC TGCTCCCTGC 
18551 TGTTTATCTC CTCCCTACCC TCATCCAAGG TGGTCTGGCT TCTAGAGTGG 
18601 GCCTTAACCC CTGGLI ILI I I I I I I I 1 1 1 1 I I 1 1 I I I II I TGAGATGGAG 
18651 TTTTGGTCTT GTTCCCCAAG CTGGAGTGCA ATGGTGCGAT CTTGGCTCAC 
18701 TCCAACCTCC GCCTCCCGGG TTCAAGCGAT TCTTCTGCCT CAGCCTCCCG 
18751 AGTAGCTGGG ATTACAGAAA TATGCTACCA TGCCCAGCTA Gl I I I I I ATA 
18801 TTCTTAGTAG AAACAGAGTT TCACTCTGTT AGCCAGGCTG GTCTCAAACT 
18851 CCTTACCTCA TGTGATCCAC CAGCCTCGGC CTCCCAAGTG CTGGGATTAC 
18901 AGGCGTGAGC CATGGCACCT GGCCTACCAC TGALI I I IGA TTACTCAAAG 
18951 CATGAAGGGT ATATATGATG GGTCTGCAGG CATCGTTCCT GAGGAATTGT 
19001 CCAAGGAGAC CCCAGACCTG GCTCAGTTTT TCTCTTCCCT CAGGAGGAGG 
19051 CCACTGAGGG ATCCCAAGAA GAGGACAACA CCCCAGGCAG CCTGCCCTCA 
19101 GTGAGTGATT ACAGTTTGGG ATGGGGACAA GTGGGACCTT CAGGGAGGGT 
19151 TGTGGATGGT GATGGGGTCA GTAATGGCCC CAAGTGACTG GAGCTTTGGG 
19201 GGCTGCAGAA ACCACCCCCA GGCCCTGTCC CCTACCTTGG CACCTTCCTT 
19251 ACGGACCTGG TTATGCTGGA CACAGCCCTG CCGGATATGT TGGAGGTCTG 
19301 ACCCCTGACC CTTGACCCCT GACCCCAGCT CCAOTGCCC CCAGCACAAT 
19351 GGGCCTCCCA ATATCCACCC TTGATCCTAC CTGTACTCCT GACACCACCC 
19401 CACACTCCCT TACTACAGTG GGGCTCCTGA CATCCCAGCC CCTGACCTTG 
19451 ACCCTTGACC CTTGACCCTG GGTGCTGCAA TTCAGACACA CTTTGCCCCC 
19501 AGGGGGATCT CATTMCTTT GAGAAGAGGA GGAAGGTGAG TGGAGGCTAC 
19551 AGTGGGTGTC GTGGTGCCTG AGGGTGGGGG TGGGGCAGGG GTAGGGTCTT 
19601 AGAGGCTCGT CCTCCAGGAG TGGGAGATCC TGGCCCGCAT CCAGCAGCTG 
19651 CAGAGGCGCT GTCAGAGCTA CACCCTGAGC CCCCACCCGC CCATCCTGGC 
19701 TGCCCTGCAT GCCCAGAACC AGCTCACCGA GGAGCAGAGG TGACCACCCT 
19751 GTAGCCTGTC CCAGCCCCAC CCCAGCTGAG CCTGGGTCAC CAACTGGATT 
19801 CCACCCACTC CATACACACC TCCAGCTCCT CCCAAGACCC CCTCTTGAGC 
19851 CCTGATCCCC CACTACAACC TGTGACCTTG CAGTATCTCC AGTCGAATCA 
19901 AATAGACTGG GCCTGGTGGT TTACTCGTGT AATCCCAGCA CTTGGGAGGC 
19951 CAAGGTGGGT GGATCACTTG AGCCCAGGAT TTCGAGACCA GCCTGGGCAA 
20001 CATGGCGAAA CCCCATATCT AGAAAAAAAT ACAAAAATTA GCTGAAOGTG 
20051 GCTGGGCAGG GTGGCTCACA CCTGTAATCC CAGCACTTTG GGAGGCCGAG 
20101 GCGGGTGAAT CACATGAGGT AAGGAGTTTG AGACCAGCCT GGCTAACAGA 
20151 GTGAAACCCC GTCTCTACTA AAAATACAAA AAAAAAATTA GCCAGGTGTA 
20201 GTGGCAGGGG CCTGTAGTCC CAGCTACTTG GGAGGCTGAG GCAGGAGAAT 
20251 GGGGTGAACC CGGGAGGCAG AGCTTGCAGG GAGCCGAGAT GGTGCCACTG 
20301 CACTCCAGCC TGGGCAACAG AGCGAGACTC CGTCTCAAAA AAAAAAAAAA 
20351 AAAATTAGCT GAATGTGGTG TTGAGTGCCG TTGGTCCCAA CTACTTGGGA 
20401 GGCTGAGGTG GGAGGATTGC TGGAGCCTGG GAGGCAGAGG TTGGAGTGAG 
20451 CCAAAATCAC GCCACTGCAG TTCCAGTCTA GGTGACAGAG TGAGACCCTG 
20501 TCTCAAAAAA AAAAAAAAAA AAATAGTCAC AATTGACCTC TGACCTCAAT 
20551 TTCAACCCCA TCTGATTTTC TGACCTCAAC TTTAGCATTC AGCTGGCCAT 
20601 TCAACTCAAC TGTCCCATCT GTTGACTTCC CCATCTTTGG TCCTATCTGA 
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20651 CCCATGACCT TATTCATGAC CCCTCATCTG ACTCTCTGAC CCCAACCCTT 
20701 GACCCTCAGT TCTGAGTAAC TGACTCCAAC TTTTATCTTT GACTGTCCAG 
20751 CTTGACTATC ACAACTGTGT CCTTTCTTTC TATATAACTG TGACCCTAAC 
20801 CATTGACCCC AATGGTGACC TGACCCCAGT CTGACCCTCA CTTTATTTTA 
20851 TTTATTTATT TATTTATTTA TTTATTTATT TATTTATTTA 1 1 1 I IGAGAC 
20901 AGAGTCTGGC TCTGTTTCCC AGGCTGGAGT GCAGTQGAGT GATCTCGGCT 
20951 CACTGTAGCC CCCGCCTCCC AGGTTCAAGC AATTCTACTG CCTCAGCCTC 
21001 CCCGGTAGCC GCAATTACAG GCGCGAGCTA CCACACCTGG CTAAI 1 1 1 1 I 
21051 TTGTAI 1 1 1 1 AGTAGAGACG GGGTTTCACT ATATTGGCCA GGCTGGTTTC 
21101 GAACTCTTGA CCCGAAGCAA TCCTCTCGCC TCAGCCTCCC AAAGTGCTAG 
21151 GATTACAGGC GTCAGCCACT GCACCCAGTC CTGACCCTGA CTTTAATCCT 
21201 GACCCAATTT GATTCCTTAG TGCCACCCTG TGAATCTCTT TCTGACCTCC 
21251 TGACCAGCCA TCCTGTCCCA TCTCTGATAA GACCTTGATG CTCAATGACC 
21301 CTCATTTACC ACCCTGACCC TGGCATGTCG GGTGCCACCT CTGGCTGCTC 
21351 CCCCTTACAC CCCAAACCCA CCTCCCAACT GATTCCAACT CTTATCTCTC 
21401 CATCCCCTGT ATTTCCTGCC CCCACCACCT CATCCACATA TTCACCCCTC 
21451 AGCTACCGGC TCTCCCGGGT CATTGAGCCA CCAGCTGCCT CCTGCCCCAG 
21501 CTCCCCAGGC ATCGGACGGC GGATCAGCCT CACCAAGCGT CTCAGTGCGT 
21551 GAGTCTGGGG GTGTGTGTAG GGGCGGTGAT GTGGGCAGAT ATCAGCAAGG 
21601 GCTCCTCCre CCTTAGCCTC ATCCCCTGTC cccatcctta ggaagcttgc 

21651 ccgagagaaa agctcatcac ctagtgggag tcccggggac ccctcatccc 
21701 ccacctccag gtgagcattc tgcttggtca tgggactggg gatcatggga 
21751 tcaggagtca gcacagccac cccacctcag cctctgcatc tcccccag7g 
21801 tgtccccagg gtcacccccc tcaagtccta gaagcagaga tgctcctcct 
21851 ggcagtcccc cggcctctcc agggccccag ggccccagca ccaaggtacc 
21901 aagacggctt gtgtgtgcat gcgggcctcc gggcacccag gctctgtgtg 
21951 tgtgcacgtg tgtgtgcatg cacatgtgta cacacaggat tgtggggcca 
22001 ggagtgtata caggaggcac actgagcgcc cggggtatcc atccagggga 
22051 ttgcatgcat ctgcacggcc ctgtttgggt gatcactcat aaatccgact 
22101 cgtgctcaga tttggacctg tgtaactgct tgcccatggg tcatctaggg 
22151 tgcaatcaca tcacacccct ttttatttca aacagggtct tcttgctc7g 
22201 tcacccaggc tgaagtgcag cggtgcaatc tcagctcacc gcaacttcca 
22251 cccctccccc aggctcaagc aatccttcca cctcagcctc ccaagtagct 
22301 aggaccacag gtgtgcacca ccatgccctg ctai 1 1 1 1 1 1 tatttagtag 
22351 agatgaggtt tcgccatgtt gcccaggtgg gtttcgaact cctgagctca 
22401 aacaatgcac tcacctcggc ctcccaaagt gctgggatta caggtgtgag 
22451 ccaccgcacc cagcctacac i i i i i pgagg acatgtatgt ccctaagaat 
22501 ctgcatacca tggcagacac ggtggctatt gcctgtgatc ccggcacttt 
22551 gggaagccaa agtgggagga ttgcttgagg ccgggagttc aagaccagcc 
22601 tgggcaacat agtgagaccc tatttctatt aaaagtcaaa aaaattagct 
22651 gggtgtagtc ccagctactc agcaggctga ggtggaagga tcgcttgagt 
22701 ttgaggctgc agtgagctac gatcatgcca cggcactcta gcctgcatga 
22751 tagagcgaga tcctgtttat gaagaaaaag agactgggca cggtggctca 
22801 cgcctgtaat cccagcactc tgggaggccg aggtgggcgg atcacgaggt 
22851 caagagatcg agaccatcct ggccaacgtg gtgaaaccct gtctctactg 
22901 aaaatagaaa aattagctgg gtatggtcgc gcacacctgt agtcccagct 
22951 acttgggagg ctgaggcagg agaatcactt gaacccagga gacggaggtt 
23001 gcagtgagct gagatggtgc cactgcactc catccagcct ggtgacagag 

23051 CGAGGCTCCC TCTAAAAGAA AAACAAAAAA AGAAAAGGAA ATGAAGGAAA 
23101 TGAAGGCTGG GCATGGTAGC TCATGCCTGT AATCCCAGCA CTTTGGGAGG 
23151 CCGAGGCCAG TGGATCACTT GAGGCCAGGA ATTTGAGACC AGCCTAGCCA 
23201 ACATGGTGAA ACCCCGTCTC TACTAAAAAT AAAAACATTA GCTGGGCATA 
23251 GTGGCACAGG CCTGTAATCC CAGCTACTTG GGAGGGTGAG GCATGAGAAT 
23301 TGCTTGAACC TGGGAGGCAG AGGTTGCAGT GAGCTGAGAT GGCACCACTG 
23351 CATTCCAGCC TGGGTGACAG AGCAAGACTC TGTCTCAAAA AAAAAAGAAA 
23401 AGAAAAAGAA AAGAATCTGT GTACCAGAAG AGGAAATGTG GGCCTGAGTA 
23451 TTCATGAGAT CATGTGTGGG GTTGTTCATT GGCATGGGCT GTGGGTGTAT 
23501 AACCGCTGTC AGCATATGTA TGTACACAGG ATTTCTTGTG TATGAGCATG 
23551 GGTTGTGTGT ATATGGACAC TGTTCATGTC TGTTTCTATA ACAGGTAACC 
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23601 AAAGTCTGTA TATGGTAGGG TGGTGTATAT GCAQQCTTGT GAATGTACTC 
23651 CAGTTGCA7G TCCCAGGCTC 7GCATGTGTA GGGGGTAGTA GTATGI 1 1 IC 
23701 TTGAGATTTT ATTTTATTTT ATTATTTATT TATTTATTTT TGAGATGGAG 
23751 TCTTCCTCTC TCAOGCAGGC TGGAGTGCAG TAGOGTGATC TTGGCTCACT 
23801 GCAACCTCTC CCTCTCAAGT TCAAGTGATT CTCCTGCCTC GGCCTCCCAA 
23851 GTAGCTGGGA TTACAGGCAT GCGCCACCAG GCCCTGCTAA I I 1 1 IG TATT 
23901 TTTAGTAGAG ACGGAGTTTC ACCACGTTGG CCAGGCTGGT CTTGAACTCC 
23951 CGACCTCAAG TGATCCGCCC ACCTCGGCCT CCCAAAATCC TGAGATTACA 
24001 GGCATGAGCC ACTGCGCCCA GCCAATGTTT TCTTGAGATT TTAAATGTGG 
24051 GGCTATTGAA TGCACCAGTG GTGGCTGGGG TGTTCGTGCT TTTCTAGCCC 
24101 TCAGCATCTC CAGATGGGCC AAGCTGTAGC CTCCACCCCT TACTGCCTGC 
24151 AGCTCCCCCT GAGCCTGGAC CTGCCCAGCC CCGGGCCCTT CGCTTTGCCT 
24201 CTGGGCAGCC CTCGAATCCC CCTCCOGGGG CAGCAGAGCT CGGAGGCCCG 
24251 TGTCATCCGC GTCAGCATCG ACAATGACCA CGGGAACCTC TATCGAAGCA 
24301 TCTTGGTGAG GGGCTGGGCT GGGGGTCTCC TGGAGGCTGC CCTGCCCTTG 
24351 GGGCCGGGGC CCTCACCTCA CCTCCCGCCC CTCTCTTCCA GCTGACCAGT 
24401 CAGGACAAAG CCCCCAGCGT GGTCCGGCGA GCCTTGCAGA AGCACAATGT 
24451 GCCCCAGCCC TGGGCCTGTG ACTATCAGCT CTTTCAAGTC CTTCCTGGGG 
24501 ACCGGGGTGA GCAGGGATGG GTTGGAGCTC AGGATAGGGG GCAGCGGGGA 
24551 GGCGAGCAGA CTGACCACGC CCAAGGATGG AGCCCAAGGT TACCCGGGTT 
24601 CACAGGGCTG TGAGGTGCTT CAGGCAGAGA GTAGGGGTAA GATAATCAGT 
24651 GGAGGTAAGA GGACATAAAA TACCTGTAAC CCAACGATGT AGGGTCATGA 
24701 GATTGTCTTG GCTCAGTGTG AGAGAGAGGT ACCAAAGGTC ATCTTCCTAA 
24751 AATTTAAAAG ACAATAAGAT TGTCCAGGGT CCGGCCAGGC GCAGTCGCTC 
24801 ATGTCTGTAA TCCCAGCACT TTGGGAGGTC AAGCTGGGCG GATCACTTCA 
24851 GGTCAGGAGT TTGAGACCAG GCTGACCAAC GTGGTGAAAC CCCGTCTCTA 
24901 CGAAACATAC AAAAATTAGT CGGGTGTGGT GGCACACTCC TGTAGTCCCA 
24951 GCTACTCAGG AGGCTGAGGC AGGAGAATAA TTGCTTGAAT CTGGGAGGCG 
25001 GAGGTTGCAG TGAGCCGAGA TCATACCACT GCACTTCAGC CTGGGCAGCA 
25051 GAGCGAGACT CTGTTTAAAA AAAAAAAAAA AAAAAAAGAC TGTCCAGGGA 
25101 CAAGTGACAG AAGGGAGTGT TTCTGACCTT CAATTTGTAG GATGGGCTGG 
25151 GGATGGTGGC TCACAACTGT AATCCCAGCA CTTTGGGAGG TCAAGGTCGG 
25201 TGGATTGTCT GAGCTCAGGA GTTTGAGACC AGCCTGGGCA ACATGAGGAG 
25251 ACCCCATCTA TACAAAAAAT AGAGAAATTG GCTGGGTGCG GTGGCTCAAC 
25301 GCCTGTAATC CCGGCACTTT GGGAGGCCAA AGGGGGTGGA TCACTTGAGG 
25351 TCAGGAGTTC GAGACCAGCC TGGCCAACAT GGTCAAGCCC CGTCTCTACT 
25401 AAAAATACAA AAAAATTAGC TGGGCATGGT GGCACATGCC TGTAGTCTCA 
25451 GCTACTCGGG AGGCTGAGGC AGAAGAATCG CTTGAACCCA GGAGGCGGAG 
25501 GTTGCAGTGA GCCGAGATCG CACCACTGCA CTCCAGCCTG GCGACTGAGC 
25551 AAGACTCTGT CTCAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAGCCAAT 
25601 ATATATATAT ATATATAGAG AGAGAGAGAG AGAGAGAGAG AGAGAGAGAG 
25651 AGATTAGCTG AGCATGGTGG CATGTGCCTG TATTCCCAAC TCCAACTACT 
25701 GGGGAGGCTG AGGTGGGAGG ATCACTTGAG CCTAGGAGGT GGAGGCTGCA 
25751 GCGAGCTGAG ATCACGCCAC TGCACTCCAG CCTGGGTGAC AGAGCAAGAC 
25801 CCTGTCTCAA TTAAAAAAAA AAAAGGGGGC CGGGCATGGT GGCTCACGCC 
25851 TGTAATCCCA GCACTTTGGG AGGCCGAGGC GGGTGGATCA CGAAGTCAAG 
25901 AGATCGAGAC CATCCTGGCC AACATGGTGA AGCCTCGTCT CTACTAAAAA 
25951 TACAAAAAAT TAGCCAGGCA TGGTGGCAGG CGCCTATAGT CCCAGCTACT 
26001 CAGGAGGCTG AGGCAGAAGA ATCACTTGAA CCCAGGAGGT GAAGGTTGCA 
26051 GTGAGCCAAG ATTGCGCCAC TGCACTCCAG CCTGGCGACA GAGTGAGACT 
26101 CCGTCTCAAA AAAAAAGAAA AAATAGATTG TCTAGGGTCG AGTGAGAGAA 
26151 GGGAGTGTAG AAGTTTGTCT GATCTTAAGT TTGTAGCATC ATGAGATTGT 
26201 TCAGGCTCAA CCTGATGGGA TGGGAGACTA AAGGGCATCT GGGCTTAGAT 
26251 TTGTGAGAAC TAAGTTTGTT CACCACTGGG ACCCTGAAGT TATCTGAACT 
26301 TGGGACGGGA GAGAGGCAAA TGGATAGCCG CGGAAGCATG AGATTGTCCT 
26351 GTCTGACAGG GAGAAGCAAG GGATTGAGCG TATTCACGCT GAAGTACATG 
26401 GCATGAGGTT GGCTGGATAT TAGGAAAGGA TGCTTGTGGT TGTTCAGGTG 
26451 TTGAGTGTGA GGCCACAAGC TCGTGCAGGC TGGAAGTGGG AAGTTATTCA 
26501 AGTTCATGGT GACAGCAGCA TGGGATTGGC TGGGAGTGGT TGTGGGGGAG 
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26551 GGGTAGGGTG AGCAGGAAGT TGTTTGGCGG GGGGTGGTCT AGGGTGGTCT 
26601 AAGTTTGCCC AAACTTTTAC TGCAGGTTGT CGGI 1 1 IGI I TGI I I I IGI I 
26651 I I I I I 1 1 I 1 1 TGAGATGGAG TCTCGTGCTG TTGCCCAGGC TGGAGTGCAA 
26701 TGGCAGGATC TCGGCTCACC GCAACATCCA CCTCCTGGGT TCAAGAGATT 
26751 CTCCTGCCTC AGCCTTCGGA GTAGCTGGGA TTACAGGCAT CTGCCACCAT 
26801 GCCTAGCTCA I I I 1 1 I IQGT ANN I ACTA AAGACGGGGT TTCACCATGT 
26851 TGGCCAGACT GGTCTCGAAT TCCTGACCTC AACTGATCCA CCCACTTCGG 
26901 CCTCCCAAAG CGCTGGGATT ACAGGCAGGA GGCATCGCGC CCGGCCAGTT 
26951 TGCTCAAACT TTTACTGCAG GTTGCCTTGT CTCTATGGTG AGGGGGAGAA 
27001 T ATTAGG AGG TTGCCCAGGC TTATGATAAG GGAAGGCATG AGGTGGTGCA 
27051 AGTTTTCAAG TGAGAAGTCG TCCAGGTTCC CAGTGACAGC AGAATGAGCT 
27101 TGGCTTGGCA GTAGCTGCAG AGGGACCCAT GGCTGTTCAG GTTCGCGGGT 
27151 GAGTGGCAGG AGGCTCCCGG GTCCTCTGTG GGGGTGACAC AAGGTTGTGA 
27201 GGGCCTATTA CCACCATCTC CACTCCTGAC CAGTGCTCCT GATTCCTGAC 
27251 AATGCCAACG TCTTCTATGC CATGAGTCCA GTCGCCCCCA GAGACTTCAT 
27301 GCTGCGGCGG AAAGAGGGGA CCCGGAACAC TCTGTCTGTC TCCCCAAGCT 
27351 GAGGCAGCCC TGTCCTCTCC ACAAGACACA AGTCCCACAG GCAAGCTTGC 
27401 GACTCTTCTC CTGGAAAGCT GCCATCCCCC AGTAGAGGCC ACTGTGCTGT 
27451 GTATCCCAGG ACCACCACCC AACTGTAGCC CATTGGACCC CATCTCTTTT 
27501 TCTGACTCTG TTGGTACTAG ATCCATATTC CAAAGACATC AGCCCATGGG 
27551 TGGCTGGTGG AGAGCTCAAT CCCATAAATG TAGAAAGAGG TGGGGCATGG 
27601 ATACGTCAAA TCCCTCCCCA GAGAAATCTT ATAAATGTTA GAGACGCATC 
27651 AGAAGTGACA GATGCGGATG AAAATAGTGA CCAGAGTTAT GAAACAGCTG 
27701 TCAGTCTTGT TTATTTTGCG CCTGTGTGCC ATGTTCACCC TTTATCAAGA 
27751 TAAAGGAAAA CAGCTACCAC ACACACACCC ACACACACAC ACACAAACAC 
27801 ACAGAGAGAG AGAAACCTAA GAGCCAAGAC CAGCCCGGGC AACATAACGA 
27851 GATCCTGTCT CTACAAAAAA TACAAAAATT TGGCTGGGCG TGGTGACTCA 
27901 CGCCTGTAAT CCCAGCGTTT TGGGAGGCCA AGGCAGGCAG ATCGCCTGAG 
27951 GTCAGGAGTT CGAGACCAGC CTGGCCGACA TGGCAAAACC CCATCTTCTA 
28001 AAAGTACAAA AAATTAGCCG GGCGTGGTGT CATGCACCTG TAATCCCAGC 
28051 TACTGGGGAG GCTGAGGCAG GAGAATTGCT TGAACCCGGA AGGTGGAGGT 
28101 TGCAGTGAGT GGAAATCACA CCACTGTACT CCAGCCTGGG TGACAGAGCA 
28151 AGACCCTATC TCAAAAACAA ACAAACAAAC AAATGAACAA ACAAAAAATT 
28201 TTCTGAGTGT GGTGATATGA GACTGTAATC CTACCTACTT GGGAGGCTGA 
28251 GCTGGGAGAA TCACCAGAGC CCTGGGAGGT TGAAGCTGCA GTGAGCAGTG 
28301 ACTGGGCCCC TGCACTCCAA CCTGGAGGAC AGAGGGAGAT CCTGTCTCAA 
28351 AAACAAAAAA ACTAAGAGCC CTAAGAAAGG TGTTGAGTCG GGTATGACAC 
28401 TCAACCCAGA TGCCAGAGAG GATCCTGTCT GGCCGGACAC AGTGGCTCAG 
28451 GAGGGTAATC CCAGTACTTT GGGAGGCTGA GGTAAGAGGA TTGCTTAAGT 
28501 TCAGGAGTCC GAGAGCAGCC TGGGCAATAC AGTGAGATCA CATCTCACTA 
28551 AATAAATAAA TAAAGGATCC TATCACACAA AGAGGGTTTA GGACTTCCTT 
28601 CCCCAACATT TTTGGGGTGA TATGCCTCTT TTCTACTGTA TATATGGGAG 
28651 AGTGACTAAC TGAAATTCCA TCAGAATTAG AAACAAATAG CATCATTACC 
28701 CATGAGTCAA TAAGGGCTGT GAGGATGGGC CCTTTCACTT GCCCTCACCT 
28751 TCTTCCTCTT CCTGTCACAG ATAACCCATC TGTGCAAAGA AGAGAAAAAG 
28801 AGGTTGGGTG TGGTGG CTCA CATCTGTAAT CCCAGCACTT TGGGAGGCTA 
28851 AGGTGGAAGG ATTTTGAGCC CAGGAGTTTG AGACCAGCCT GGGCAACATA 
28901 GTGAGACCCC ATTTCTACAA AAAAATACAA AGATTGGCCA GGCGCGGTGG 
28951 CTCACGCCTG TAATCCCAGC ACTTGGGAGG CTGAGGCAGG CGGATCATGA 
29001 GCTCAGGAGA TCGAGACCAT CCTGGCTAGG TGAAACCCCG TCTCTACTAA 
29051 AAATGCAAAA AAATTAGCCG GGCGTGGTGG CGGGCGCCTG TAGTCCCAGC 
29101 TACTCGGGAG GCTGAGGCAG GAGAATGGCG TGAACCCAGG AGGCGGAGCT 
29151 TGCAGCGAGC CGAGATCGCA CCACTGCACT CCAGTCTGGG CGACAGAGGG 
29201 AGACTCCATC TCAAAAAATA AAATAAAATT TAGCCAGGTT TGGTGTCCTG 
29251 CACCTGTAGT CTCAGCTACT CTGGAGGCTG AAGCACGAGG ATCACTTGAG 
29301 CCCAAGAGGT GGAGGTTGCA GTGAGCCGAG ATTACTGCAC TCCAGCCTGG 
29351 GTGACAGAGC GAGATCCTGT TTCAAAAAGC AAAAAAAAGG GCCAGGCGCA 
29401 GTGCTCACAC CTGTAATCCC ATCATTTTGG GAGGCTGAGG TGGGCGGATC 
29451 ACTTGAGGTC AGGAGTTCAA GGTCAGCCTG GCCAACATGG TAAAACCCTG 
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29501 TCCCTACTAA AAAATATAAA AATTAGCTGG GCA7GGTGGT GGGTGCCTGT 
29551 AATCCCAGTT ACTCAGGAGG CTGAGGCAGG AGAATTGCTT GAATCCAGAA 
29601 GGTGGAGGTT GCAGTGAACC GAGATCATGC CATTGCACTC CAGCCTGCGT 
2%51 GACAAAGTGA GACTGTATCT CAAAAAAAAA AAAAAAAATG CTGGGCACAG 
29701 TGGCTCTAGC ACTTTGGGGG GGCAAGACGG GTGGATTGCT TGAGGCCAGG 
29751 ATTCCAAAAC CAGCCTGGCC AACATGGTGA AACCCCTTCT CTACTAAATA 
29801 TACAAAAAAT TAGCCGGGCA TGGTGGCAGG CTCTTGTAAT CCCAGCTACT 
29851 CGGTAGGCTG AGGCAGGATA ATCACCTGAA CCAGGCAGGC AGAGGTTGCA 
29901 GTGAGTCGAG ATCGCTCCAC TGCACTCGGG CCTQGGCAAC AAGAGCAAAA 
29951 TTCTGTCTGG AAAAAAAAAA AAAAGAAAAA GAAAAGGATT GTGAGGATGA 
30001 AAAGAGAGGC GTGAGCTCTC TGTCAGCGTT GGAGTACAAT AGAGAGGATG 
30051 AAATGAGCTG TAGGGCGAAC TGCTACATAG TCACAACCAC AATAATATGC 
30101 CCACTTATGA GCTCCTACTC AGCAGAGAAC ATCAGCTATG GTCTTTACAT 
30151 CTCATTGCAC TAATCGAGTT CTTTCTGTTG CAAGCGACCA AAAACCCAAT 
30201 TCAAAGAGGC ATGTCCAAAA AAGGACATTT GTGGCTTATG CAGTTGAAAT 
30251 GTCCAATGAG TAGGGCTTCA GGCACAGTTG CATCCAGGCA CTCATAAGAT 
30301 GTCATCAGGG TTTTCTTGCT GTCTCTTTGC TCTGATTTGC TCTGAGAATG (SEQ ID NO: 3) 



FEATURES: 

Start: 3000 

Exon: 3000-3032 

intron: 3033-3493 

Exon: 3494-3607 

intron: 3608-5219 
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Exon: 19503-19535 

intron: 19536-19617 

Exon: 19618-19739 

Intron: 19740-21452 

Exon: 21453-21548 

intron: 21549-21641 

Exon: 21642-21710 

intron: 21711-21798 

Exon: 21799-21895 

intron: 21896-24152 

Exon: 24153-24305 

intron: 24306-24391 

Exon: 24392-24506 

Intron: 24507-27233 

Exon: 27234-27349 

Stop: 27350 
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ALLELIC VARIANTS (SNPs) : 
DNA 



Posi ti on 


Maior 


Minor 


Domain 

L/VJIIICl III 


1695 


A 


C G 


Bevond ORFfS 1 ^ 


4607 


T 


c 


Tntrnn 

—■ 1 CI V/l 1 


5133 


G 


c 

V— 


Tntrnn 

XI 1 LI vi 1 


6190 


A 


c 


Fvnn 


6196 


T 


r 


Fxnn 


7403 


T 


G C 


Tntmn 

Jul 1 CI V/l 1 


9981 


c 


T 
■ 


Tn1~mn 

— 1 1 LI V/l 1 


9998 




T 


Tnlrnn 
xi i ci m i 


11050 


T 


r 

V- 


Tnlrnn 

XI 1 LI V/l 1 


U772 


T 


A G 

VJ 


Tnlrnn 

XI 1 LI V/l 1 


12797 


T 




Tntrnn 

XI 1 LI v/l 1 


12801 


T 




Tntrnn 

XI 1 LI V/l 1 


12857 


T 


c 


Tnlrnn 

— 1 1 LI V/l 1 


17525 


r 

V_ 


T 


Tnlrnn 

XI 1 LI V/l 1 


17552 


G 


A 


Tnlrnn 

XI 1 CI V/l 1 


17572 


T 


G 


Tntrnn 

— 1 1 CI V/l 1 


18846 


G 


A 


Intrnn 

— • 1 CI V/l 1 


18871 


T 


c 


intron 


18872 


c 


A 


Intron 


19600 


T 


c 


Intron 


21013 


A 


G 

VJ 


Tnlrnn 

— 1 1 LI V/l 1 


22055 


A 


G 


Tnlrnn 

— 1 1 CI V/l 1 


22112 


T 


r 

v_ 


Tnlrnn 

XI 1 LI Lll 1 


22113 


j 


r 


Tnlrnn 

XI 1 LI V/l 1 


22221 


c 


T 


Tnlrnn 

XI 1 LI V/l 1 


22360 


T 


r 

V_ 


TntTYVI 
XI 1 LI v/l 1 


22543 


G 

VJ 


A 


Tntmn 

XI 1 LI v/l 1 


23751 


A 


T 


Tntrnn 

XI 1 LI V/l 1 


23764 


r 


T 


Tntrnn 

XI 1 LI V/l 1 


23782 


G 


A 


Tntrnn 

— 1 1 CI V/l 1 


24593 


C 


T 


Intron 


24673 


C 


T 


intron 


24730 


T 


C 


Intron 


24750 


A 


G 


intron 


24764 


A 


G 


Intron 


24780 


A 


T 


intron 


24856 


A 


G 


Intron 


26136 


A 


G 


intron 


26331 


C 


G 


intron 


28362 


A 


G C 


Beyond 0RF(3 f ) 


28745 


T 


C 


Beyond 0RF(3') 



Protein 

Position Maior 



Minor 



162 
164 



H 

V 



P 
A 



Context: 



DNA 

Position 
1695 



ATCACTCAAGATCCGTTCCCAGAAATTMTC 

GAGGTCCTAGaiCTCGTCCTGACACCCACCAGGCGGCCCCTTCGGAGCCCCCG 

GGTCT AGCGCACGCCACGGGCGCTTCAGGGGCTGAACGCGGCCGGACCGGGAACGGAGGC 

GGCO^GGCGCCCGGAGGGGAGGAAGGGGCGGGGCCAGACGTTCCCA^ 

CCAGCTGGGTCCGCCCGGGCCT^^ 

[A,C,G] 

TATTCATCIGI I I I I GACCCTCCACCACCCAAGAGACGGCTGTCCCTGAAAACCCAGGGC 
CACAGACTGCCTCCTG^ACCTGGGTCATGATCACT^^ 
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AGCTTAAAMTCGGAGATGGCCCGATCTACnXKJrrTTATGCCT 
GGAGGCTXaAGnXaGGAGGATCACrnGAGTCCAGGAGCTCCAGG 
CGTCCTACTGCACTCCA<XCTX2^^ 

4607 CECAGCAGTCCTOXA^ 

T TCGAGGC TXXAOTGAGCTATCA^ 

ACCTCTTTCT AAAAAAMCAMCOW\ACAATAACTCCAAGAAGCC!GQGAQ\CAGAGGAA 
TCACATGAMGAATGGTGCTAG^G^^ AATCCCAGCAC 
TTTGGGAGGCCGAGGCAAGTGGATCATCAGGTCAGG^GTT GV\GACCAACCTGGCCAAGA 
[T,C] 

GGTGAAACACCGTCrCT ACTAAAMTACGAAAAMCTAGCTGGGCTrcGTIGGCGGGTGCC 

TGTMTCCCAGATACnXXSGAGGCn^ 

TTCGACTGAGCGAAGATCA(^C^^ 

CTCAAAAAAAAAAAAAAAAAACTGGTCCTA 

TCCTAGCCCTTTGGC^GACm^ 

5133 GGTTCACGCCTCT AATCCTAGCCCTrrGGGAGACTTTGGGAGGCCA'VGGGGGGOXGATTA 
CrrcAGGrCAGGAGTTCGAGACCAGTCTGACCMCATQGTGAMCCCTATCT 
ATACAAAAATTAGCTGGGL I I <Jb I UJTGTGCGCCrGTAGTTTCAGCTACTTGG^GGCTCA 
GGMGGAGGATTCCTTGMCCCAC^ 

GCACTCCAGCCTGGGCMCAGAGreAGACTCTCTCTCAAAAAAAAAM 
[G,C] 

AGTGGTGCT AGrGATGMTGTX^CrAGAGAAGQQGTGCrGTGAGGACCACrCCTCCrCTC 

TCATGGCCACCrCTCCCCTCCTGCAGGCrCCCAGCCCCATTGCCMTACCTTCCrCCACr 

ATCGAACCAGCMGCTGAGGCTG^^ 

TOSTXJTTTCGAGACCGTGAGG^GGACCCC^GCTTCATGCCCG^ 

GGACCTTTGT ACC<^CTGCCTGCCRKTGGG^^ 

6190 CTCsTCTCTACrAAAMTACAACAACMCAAAAA 
CTTGTAGTCCCAGCTACTCGGGAGGCT^ 
TG^GCCTTQGrGACAONGTAAGACTCrATCTOW 
GC^CTCATATTCTATXTTCCCCCACT^ 
GGGCTCTTGGTGTCAGTGC^^ 
[A,C] 

CCCTGTCCATTCGGACCTGGGCAGTUrCCGMCaTTCTGGGCTmXIGGCCCCAGGGAG 

TGCTXBAGGCTCAAAMGCAGAGMGCrrCTGGMGATTTTTTGGAQGA 

GCAGGAAGAGGAGCCGCCTCAG^ 

CTGAGAAGGCCriTCCrGTL I GC 1 1 CI ICC(^CA(^GGaTTCrcrCCCCrCTCAGAGCr 
ACAAAACTT AAGCMGATTTTAMCTCTAAGCCrCAATTTCTTCATCTTT ACAATGGGGA 

6196 CTACT AAAAATACMCAACAACAAAAAAAGGTAGCTGG^ 
TCCCAGCTACTCGGGAGGCTGAG^^ 

CTTGCTGACACAGTAAGACTCTATCrCAAAAAAAAAAAAAAAAAAAAG^ 
ATATTCTATGTCCCCCACTCTGGAC^^ 

TCGTXjrovGrecreQGcrccreGcrecAQ^ 

[T,C] 

CCATTCGGACCTGGGCAGTGTCCGAACCTTTCrGGGCrGGGCGGCCCC^GGGAGTGCTGA 
GGCTCAAAMGCAGAGMGCrrCTCGAAGATTTTTT^ 

AGAGGAGCCGCCTCAGGTOTGGACAQGTGAGGGGTTTTCAGATCCAGT CGTGTT CTGAGA 
AGGCCTTTCCTCTL I (jC I I CI I CCCACACAGGCTTTCrCTCCCCTCrCAGAGCrACAAAA 
CTTAAGCAA^TTTTAMCTCTAAGCCTCM 

7403 TTAMTTTMTTTMTTTMTTTTATTTTTTTATCT A I I I I I 1 1 I 1 1 1 AGACAGAGTCTC 
GG^CTCrrTACCCAGGCTGGACJTC^ 
CTOGCTCAAGCCAT TCTCaT^ ^ 

CACCACGCCCQGCTAAI 1 1 1 1 1 1 G I AAAGGTGAGGTTGTGCCATGTTCCCCAGGCTGGTC 

TOWVCTCCKWVCTCAAGTCATaX^aXKICrreGC 

[T,G,C] 

CATAAGCCATTCrGCATGCGTAGCCrCCrrACrrGATTATrGGCTTTlT^ 
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GGCTGTGAGTCXZATGAGAGGAGGACC I b I lb 1 1 CI I GCTCCCAGCTCTCrCCCCAGGGGC 
AGGM(^CACAGATTAGTTTGOXjAATMTTGCA^^ 
ACCCATCTXaTATTTGATCTGATCCACATCACAAAAGCA^ 
CrrMTCTCCAGATTATAGVVGGCCAC^ 

9981 CCTCCTCAGT A GCTAAGACT ^C ACTTCCACCATXHAGrTTAGMGAAAGTAGATGACCAC 
CATGXTCATCTATTTTATTTTMCAAC^^ 

TTCAGACATATACAAAAGTAGAGAGAATAGT ATGMGAACATTCAGACATCCATCACCTA 
TCATCAACGATGATCAATTTCACAAAAAAATATT^ 

GGGCTTATGTCAATTCATACATAAA I G I I 1 1 GGGTACACATGTCTGACAAGAGGCTTACT 
[C,T] 

1 1 1 I 1 1 1 I 1 1 1 1 1 I 1 1 I GAGACGGAGTTTGGCR. I IGI I GCCCAGGCTCGAGTCCAATGG 

CAGGATCTCAGXTCACCTCMCCTC^ 

CTCCG3ACTAGCTGGGATTACAGGCGTGCACCACCAC 

GTAGAGAGGGGGGTTTCTCCA'I G I I IGICAGGCTGGTCTCGAACTCCTGACCTCAGGTGA 
TCCGCCCACCTTGGCCTCCCAAAG I G I I (jGTATTACAGGCGTGAGCCATGGOGCCCGGCC 

9998 ACXACACTTGX^CCATCTAGTTTAG 

TTTTMCMCrrTATTTTGGGTTCAC. I I I 1 1 GCTATGGAAAATTTCAGACATATACAAAA 
GTAGAGAGAATAGTATGAAGAACATTCAGACArCCATCACCTA^ 
TTTCACAAA AAAATA TTTTCAGGATCATTTTAAMCAAATCCOGGG 
TACATAMTGTTTTGGGTACACATGTCTGACAACAGGCnAL I I I I 1 1 I I I I I I I I I I I I 
[-,T] 

GAGAGQGAGTTTCGCTC I IGI I C^CCAGGCrGGAGTGCAATGGCAGGATCTCAGCrCACC 
TCAACCTCTGCITCCTXaGGTTCAAGTGATTCTCCTGCCn^ 

TTAO\GGCGTGCACCACCACACCCGGCTAATTTTG"A 1 1 1 I I AGTAGAGAGGGGGGTTTC 
TCCAIGI I I G I CAGGCTCGTCTCGAACTCCrG^CCTG^GGTGATCCGCCCACCrTGGCCT 
CCCAMGTGTTCGTATTACAGGCGTGAGCCATCGCGCCCGGCCCT^ I I I I IAI I I I I 

11050 ACTCCTXSACCCTCC^CTCAGCCTCCTCAGTAGCrGGGA 

ACCCAGCCCTAAAI mUTI IGACAGI IGI 1 1 CTGGCCAGGTG I IblGGCACATGCCTA 

TAGTCCCAGCTACnAGGATGCTGAGATQGGAGGATCTCnX^CTC 

CreCGGTGAGCTGTCAGCATX^CCCnKIACTCOVGGCGATAGAGC^^ 

TAG I IGI I I LI I CAAATTGAMTCCAMGATCTACTCMGGTATTTGG I IGI I IGCTTCT 

[T,C] 

I I I I 1 1 I I I I 1 1 I I 1 1 1 I I I I I I GAGATGX^GTCTCACTCTGrrGTCCAGGCrGGAGGGT 
AGTGX£GTGATCTTGXXTCACTCCMCCrC^ 

TCAGXIC rCCTGAGTAGCTGAGTTTACAGGTQZCCACCAACACGXICQ\G^ I I 1 1 I G TA 
TTTTTAGTAGTGAGGGXiiGTTTG^CCA^ 

AGTGATCTGCCCACATCGGCCrCCCG\AGTGTCGGGATTACAGACATCACT 

11772 C A I I 1 1 1 1 1 1 I C CAGGGCCCA G I I 1 1 I I 1 1 GGGGTGGGGGGAGGAGAGACAG I I IU IG C 
TCTGTCACCCAGGCTTGGAGTTCCT^ 

GGGCTGV\GAGATCCTCCCT CCTAAGCCTCT1T^GTAGGTGGX^CAGCAGGTGTGG\TG\ 

GXy^TGCGCAACrTTAAAAA I 1 1 I 1 1 IATGTAGACAT 

[T,A,G] 

GGXn"CTCACTACGCCGXICCAGGCnxn"CTCAMCrCCTGGTCT^ 
TCMCCTCCAAMGirXrTGGG^CTATAGGTGTGCCCAGCCCAG^ 
TMTATTT TGXAAT GXICACCTGTCCrMTTCMGATGAAA 
CAAAGATTATTTTAAMTMTAGTATTCGGGCAGGG 

12797 CACAGTAGAGTCTCACGGTGXTGTTGTACT^ 
AGGGCACraTTCAGGCAMG^CAGX^ 
TGMCAACATTTGCAATTTGCAAAC^AAACAMGCCCAGACT^ 
TTG6AGCCAAAAAAAI I IGIGI I I ATGAAOVWVrAGTCAGGTTCT AGGTGCATATTATT 
GCAAAC A I bl 1 1 I IL I I I I GI I I I I GI I 1 1 I G I I IGI I IGI I I G I I I I GI I I IG I 1 1 I G T 
[T,-] 

TTTTGAGATGX^GTCTCGXTCTCT^ 

ACTGCAAGCTCCGCCTCGXICGGTTCACGCC^TTCrCCTGCCTCAGCCrCCT 
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GGACTACAGGCGCCCGCCACCACGCCTGGCTAA 1 1 I I I ICTAI I I I I I AGTAGAGACGGG 
GTTTCACCATGTTAGXICAGX^TGCT^ 

CCrCCCAAAGTTCTGGGATTAC^GGGGTX^GCCACTGCCCCCGGCCI lUIIIUIIICT 

12801 CTAGAGTGTCACGG I GC I G 1 1 G I ACTGACAGCAACAAGCACCMCGMCGCACAGGAGGG 
CACTCGTGAGGGW\GAG\GCAACATAGCi 1 1 (_ I GGGGACATCATTTTCCAAACTTOTGAA 
OVXOVTTTGXA^TTTGXAMCAAMC^^ 

AGCCAAAAAAAI I IG IGM 1 ATCAACAAMTAGTCAGGTrCTAGGTCCATATTATrGCAA 
ACATGI 1 I 1 1 < — III I <_ I 1 1 1 IGI I 1 1 IGI I IGI I IGI I IGI I I IGI 1 1 IGI I I IGI I I I I 

Cr.-] 

GAGATGGACTCTCGCTCTGTCCCCCAGGCT^ 

CAAGCTCCGCCTCGCCGGTTCACGCG\TTCrCCTGCCrCAGCCTCCTCGCT 
TACAGGCGCCCGCCACCACGCCreGCTAAl I 1 1 1 ICTAI 1 1 1 1 1 AGTAGAGACGGGGTTT 
CACCATGTTAGCCAGGATGGTCTCGATCTCCTGACCTCGTGATCTACCCGCCTTGGCCT 
CCAAAGTGCTCQGATTAOVGGCGTXaAGCCACnSCCCCCGGC C I I C I I I I C I I I I I I I I I I 

12857 AGGGCACTOGTGAGGCAAAGACAGCMCATAG^ 

TGAACMCATTTGCMTTTCCAAAO\AAACAAAGCC 

TTGGAGCCAAAAAAA I I I GIG I 1 1 ATCAACAAMTAGTCAGGTTCTAGGTGCATATTATT 
GCAAA CATGI I I I ICI I I ICI I 1 1 IGI I I I IGI I IGI I IGI I IGI I I IGI I I IGI I I IGT 

1 1 1 1 1 GAGATGGACTCTCGCTCTGTCGCCCAGGCTGGAGTGCAGTGGCATGATCT 
[T,C] 

ACTGCAAGCTCCGCCTCGCCGGTTCACGCC^^ 

GGACrACAGGCGCCCGCCACCACGCCTGGCTAA I I I I I I CTA I I I I I I AGTAGAGACGGG 
GTTTCACCATGTTAGCCAGGATGGTCTGG^^ 

CCTCCCAAAGlGOGG GATTACAGGOCnX^GCCACTGCCCCCGGC CI I CI I I I <_ III I C T 
I I I I 1 1 1 1 I I I 1 1 I I GAGACAAAGTCTOGTCACCCAGGCTAGAGTCCCGTCXXIGTGGAC 

17525 TCAGX^O^CTCTGTCCTTCCCTACCGCTCAGCMTGACCTTATCC 1 1 G I CCCTGGCGGGT 
TGCACGI I I I I LI I I CCTCTACTTCCTGXIGTTATAGTTGACTGTC^GTG^CTGXICCTATT 
TATTCACTC^GCAAMCACAAGflAGTCACAAAGAAAAGXTTTAOT 
GCAOVGGGTGGGMOWWW^TGTTC^ 
MTCIXTTCTCAAAACTCTOGCTTATTTATAGGCAAGAT^ 
[C,T] 

GCTGGI I I I lAGAGR^TCTAGMTGGATCCCTGGGCCTGGMCATTCTCCACCCCTCC 
AGGTTTGCATGCMCrrGCTCACT(ZACCTCCTTCTGCTCT 

CTGAGAGGCCrrCCCAGCCTCCATCATCCCCAAAACCACACAT L I GG I III I I G II G II G 
TTGI IGI I GTCGTCATCA I I IGI I I I I I IGI I ICI I IGI I IGI I I GTATTGAGACAGAGT 
CTCGCTCTCT<^CCO«SACTGGAGTGC^ 

17552 TCAGC^T^CCTTATCCTTGTCCCTGGGGGGTTGCAGG I I I I ICI 1 1 CCTCTACTTCCT 
GXZGTTATAGTTCACTGTCAGTGACT^ 

ACAAAGAAMGGTrACTTMGGCCAGACTCATAGCAO\GGGTGGGAAC^ 
CTGAGGACTTTACCrn^TMGCAAMCTAAAAMTGTCTCTCAAAACT 
ATAGGCAAGATTTAGATTCTCATTGCMTCAGGCGC I GG I I I I I AGAGTGAATCTAGAAT 
[G,A] 

GATCCCTGGGCCTXXSAAC^TTCTCCACCCCTC^ 
CTCCTTCTGGTCTCTGATTAAATGTCCCTGCCTCTG^GAGGCCTTCC 
CCCCAAAACCACACATCTGG I I I I I IGI IGI IGI IGI IGI I GTCGTCATCA I I IGI I I I I 
I IGI I ICI I IGI I IGI 1 1 GTATTGAGACAGACTCTCGCTCTCTCACCCAGACTGGAGTGC 
AGTGGCAGGATCTRXXT<^CT^^ 

17572 CTCCCTGGCGGGTTGCAGG I I 1 1 ICI I I CCTCTACTTCCTGCGTTATAGTTGACTGTCAG 
TGACTCCCCTATTTATTCACTCAGX^ 
GGCCAGACTCATAG^CAGGGTGGGMC^^ 
AGCAAAACTAAAAMTGTGTTJTCAAAACTCTGGCrrAT^ 

CATTGCAATCAGGOGCTGG I I 1 1 IAGAGTGAATCTAGMTGGATCCCTGXXKCTXSGAA'CA 
[T,G] 

TCTCCACCCCTCCAGGTTTGCttTGCM I I C I GGTCT CTGATTA 
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MTGTCCaT^CTCTGAGAGGCCrTCCCAGCCrC(^TG\TCCCCW^ACG\CACATCTGG 
I I ( 1 1 IGF IGI IGI IGI IGI IGTG7TCATCAI I IGI I I I I I IGI I ICJ I IGI I IGI I IGT 
ATTCAGACAGACrrCTCGOOXJT^ 

CTGCAACCTCC^CCTCCG\GGATCAAGCMTTCTCTCrGCCrCAGCCTCCC^TCT 

18846 CCTGCTGTITATCTCCTCCCTACCCTCATC 

AACCCCTGG L I I C I I I I I I I I 1 1 1 1 I 1 1 1 1 1 1 1 1 I I GAGATGGAG I I I PGGTC1 IG I PG C 

CCMGCTGGAGTCCMTGGlX^GATCTTQGCrCACT 

GCGATTCTTCTQCCTCAGCCTCCCGAGrAGCreQGA^^ 

AGCTACTTTTTTATATTCrTACnAGAMCAG 

[G,A] 

MCTCCTTACCr(^TCTlGATCCACCAGCCrCGGCCTCCCMGTGCT 
GAGCCATCQCACCrGGCCTACCACrcACI 1 1 1 GATTACTCAAAGCATCAAGGGTATATAT 
GATGGGXCTGCAGGXIATGGTTCC^^ 

TTTTTCTCTTCCCTCAGGAGGAGGCCACreAGGGATCCCAAGAAGAQG^ 
GCAGCCTGCCCTCAGTGAGTGA^ 

18871 TCATC^AGGTCGTCTGGC^ 

I I I I 1 1 I I 1 1 I GAGATGGAG I I I IGGTU IGI I GCCCMGXTGGAGTGCAATGGTGCGAT 
CTTGGCTCACTCCAACCTCCGCC^^^ 

AGTAGCrGGGATTACAGAAATATGCTACGATGCCCAGCTAG I III t I ATATTOTAGTAG 

AMCAGAGTTTCACTCTGTTAGXICAGX-^ 

[T,C] 

AGCCTCGGCCTCCCMGTGCTGGGATTACAG 
GACTTTTGATTACTOW*G(^TGAA 

AGGAATTGTCCAAGGAGACCCCAGACGPGGCTCAG I I I 1 1 CT CTTCCCTCAGGAGGAGGC 

CACTGAGGGATCCCAAGAAGAGGACAACACCCCAGGCAGC^^ 

GAGTTTGGGATGGGXAGAAGTGGGACCrrc^^ 

18872 0\TCCAAGGT GGTCTCGCrrcrAGAGTGGGCCrrMCCCCJGG (- 1 I (J 1 1 1 I 1 1 I I I I 1 1 
I I I 1 1 1 I 1 1 I GAGATGGAG I I I IGGTLI IGI I GCCCMGXTTGGAGTGXAATGGTGCGATC 
TTGGCTCACTCCAACCTCGGCCTC^^ 

GTAGCTGGGATTACAGAAATATGCTACCATGCCCAGCTA G I I I I I I A TATTCTTAGTAGA 

MCAGAGTTTGACTCrGTTAGXIGAGGCT^^ 

[C,A] 

GTCTCGGCCTCCCAAGTGXTGX^TT^^ 

AC I 1 1 1 GATTACTCAAAGCATGAAGGCTATATATGATGGCTCT^ 

GG^TTXnCCAAGGAGACCCCAG^CCraGCTCAG I I I I I CTCTTCCCTCAGGAGGAGGCC 

ACTGAOK^TCCCAAGAAGAGGACMCACCCCAGGCAGCCT^ 

AGTTTCGGATGGGGVVCAAGTGGGACCrrCAGGGAGGG I IGI GGATG&TGATGGGGTCAGT 

19600 GACCCCTGACCCTTGACCCCT^ 

MTATa^CCCTTCATCCTACCTCT^^ 

GWKTCCTGACATCCCAGXXCC^^ 

ATTCAGACACACTTTGCCCCCAGGGGGATCTCATTM 

GTGGAGGCT ACAGrTCGGTGTGGTGGTGCCTGAGGXrrGGGGGTGGGGCAGX^GCT 

[T,C] 

AGAGXO"CGTCCTCCAGGAGTGGGAGATCCTGGCCCGC7\TCCAGCAGCrG^ 
GTOW^GCTAGACCCTGAGCCC^^ 
AGCTCACCGAGGAGCAGAGGTGACCACCCTGTAGCC^ 
CCTGXCTCACCAACTGX^TTCCArc^ 

CCTCTTGAGCCCrcATCCCCCACrACMCCrGTGACCrrGX^GTATC^^ 

21013 TGAGTAACTGACTCCAAL I I I IAIGI 1 1 GACTGTCCAGXTTGACTATGACAACTGTGTCC 
TTTCrTTCTATATMCrGTGACCCTMCCATT^CCCCAA 
GACCCTCAaTTATTTTATTTATTTATTTATTTAT^ 

TTTGAGACAGAGTCTGGCTL I G I 1 1 CCCAGGXrTOGAGTGCAGK^GKy\TCTCX5GCrO\ 

CrGTAGCCCCOGCCTCCCAGXTTTGAAGGAATTCTACT^ 

[A,G] 
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ATTACAGGCGCGAGCTACCACACCTGGCTAA 1 1 I I 1 1 1 IGTAI 1 1 1 1 AGTAGAGACGGGG 
TTTCACTATATTGGCCAGGC I GCj I I I CGAACTCTTGACCCGAAGCAATCCTCTCGCCTCA 
GCCTCCOW^CTGCTAGGATTACAGGCGrGAGCCACreCACCCA 
TMTCCTGACCCAATTTGATTCCTTAGTGCCACCC^^ 
CCAGCCATCCTUTCCCATCTCnXaATMGACCnXaATXK^ 

22055 GAGTCAGG^CAGCG^CCCCACCTCAGCCTC^^ 

CCCCCCrCAAGTCCTAGAAGCAGAGATGCTCCrecreGCAGTCCCCOQ 

ccccagggccccac£accaaggtaccaagacggcttgtgtgt^^ 

acccaggctcrgtgtgtgtgcacgtgtgtgtgc^ 

gggccaggagtgtatac^ggaggcacactgagcgc^ 

[A,G] 

TGCATCTGCACGGCCL I G I 1 1 OGGTGATCACTCATAMTCCGACTCGTGCTCAGATTTGG 
ACCTGTGTMCTGCTTGCCCATGGGrCATCrAGGGTGCAATCACATCACACCCL I II I I A 
TTTGAAACAGGGTU ICI I GCTCrGTCACCO\GGCn^GTGCAGCGGTGCAATCTCAGC 
TCACGGCMCTTCCACCCCTCCCC^^ 

TAGCTAGGACCAG^GGTrnXjCACCACCATGCCCTGCrA I 1 1 1 1 I I IATTTAGTAGAGATG 

22112 TCACCCCCCTCAAGTCCTAGAAGCAGAGATGCTC^ 
GGGCCCCAGGGCCCGAGCACCAAGGTACCAAGACGGC riGTGTGl GCATGCGGGCCTGCG 
GGCACCOVGGCTCTGTXnxnTGTXKAGGTGTGTCTGC^ 
GTCGGGCCAGGAGTGTATACAGGAGGCACACTCAGGGCCGGGGGTATCCATC 
TGCATGCATCTGCACGGCCCTGTTTGGGT^ 

[T,C] 

TGGACCTGTGTMCTGCTTGCCCArGGGTC^^ 

TTATTTGAAACAGGGTCI ICI I GCTCTGTCACCC^GGCreAAGTGCAGCDQGTGCAATGrC 
AGCTCACCGCMCTTCCACCCCTCC^ 

MGTAGCTAGGACCACAGGTGTCCAGCACCATCCCCreCTA I I 1 1 I I I I ATTTAGTAGAG 
ATGAGGTTTCGCCATGTTCCCCAGGTGGGTT^^ 

22113 TCCTGCTGGCAGTCCCCCGGCCTCTCCAGGGCCCCAGGGCCCCAGCACGV\GGTACCAAG 
ACGGC rPGTCTCl GCATGCX5GGCCTGCGGGCACCCAGGCTCTGTGTGTGTGCACGTOT 
GTXKATGCACATGTGTACAt^CAGGATTGTGGGGCCAGGAGTGT ATACAGGAGGCACACT 
GAGCGCCCGGGGTATCCATCCAGGGGATTGGATGCATGrGCACGGCCL I G I I IGGGTGAT 
CACTCATAAATCCGACTCGTGCTCAGATT 

[T,C] 

GGACCTGTGTAAL I GC I I GCCCATGGGTCATCTAGGGTGCMTCACATCACACCCCTTTT 
TATTTGAAACAGGGTL I I (J I GCTCTGTCACCCAGGCTGAAGTGCAGCGGTGCAATCT CA 
GCT(^CCGCMCTTCCACCCCTCCCCCAGGCrCAAGCAATCCTTCCACCTCAGCCTCCCA 
AGTAGCTAGGACCACAGGTGTGCACCACCATGCCCTGCTA 1 1 I I 1 1 1 1 ATTTAGTAGAGA 
TGAGGTTTCGCCATGTTGCCCAGGTGGGT 

22221 GCGGGCCTGCGGGCACCCAGGCTCTGTGTGTGTGCACGTGTGTGTGCATGCACATGTGTA 
CACACAGGAT7GTGGGGCCAGGAGTGTATACAGGAGGCACACTGAGCGCCCGGGGTATCC 
ATCCAGGGGATTGCA7GCATCTGCACGGCCL I G I I I GGGTGATCACTCATAAATCCGACT 
CGTGCTCAGATTTGGACCTGTGTMCTGCTTGCCCATGGGTCATCTAGGGTGCAATCACA 
TCACACCCCI I 1 1 I ATTTGAAACAGGGTL I ICI I GCTCTGTCACCCAGGCTGAAGTGCAG 
[C,T] 

GGTGCAATCTCAGCTCACCGCAACTTCCACCCCTCCCCCAGGCrCAAGCAATCCTTCCAC 
CrCAGCCTCCCMGTAGCTAGGACCACAGGTGTGCACCACCATGCCCTGCTA 1 1 1 1 1 1 1 1 
ATTTAGTAGAGATGAGGTTTCGCCATGTTGCCCAGGTGGGTTTCGMCTCCTG^ 
ACAATGC^CTCACCTCGGCCTCCCAAAGTGCTGGGATTACAGGTGTGAGCCACCGCACCC 
AGCCTACAL I I I I I I GAGGACATGTATGTCCCTMGMTCTGCATACCATGGCAGACACG 

22360 TCTGCAOQGCCCTGTTTGGGTGATCACTCATAAATCCGACTCGTQ 

GITGrMCTGCTTGCCCATGGGTCATCTAGGGTGCAATCACATCACACCCG I I I I IATTTC 
AAACAGGGTCI ICI I GCTCTGTCACCCAGGCrGAAGTGCAGCGGTGCAATCTCAGCTCAC 
CGCMCTTCCACCCCTCCCCCAGGCTCMGCMTCCTTCCACCTCAGCCTCCCAAOT 
TAGGACCACAGGTGTGCACCACCATGCCCTGCTAI I I I 1 1 I I ATTTAGTAGAGATGAGGT 
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[T,C] 

TCGCCATCTTCCCOVGGTGGCTTTCGAACT 

CrCCCAMCTGCroGGATTACAGCnXjTXjAGCCACCGOVCCOVGCCTAQ^ L I I 1 1 I I G AGG 
ACATGTATGTCCCTMGMTCTGCATACCA^ 

CCGGCACTTTGGGAAGCCAAAGTGGGAGGA I IGU I GAGGCCGGGAGTT CAAGACCAGCC 
TGGGCAACATAGTX^GACCCrATTTCTATTAAMGTCAAAAAMTTAOT 

22543 MCTTCCACCCCTCCCCCAGGCT^ 

GACCACAGGTGTGCACCACCATGCCCTGCrA I I 1 1 I I I I ATTTAGTAGAGATGAGGTTTC 
GCCATCTTXKICCAQGTQGGTTTCGAACTCCrGAGCrCAM 

CCCAAAGKaCTtX^TTACAGGTCreAGCCACCGCACCCAGCCrACA L I I I I I IG AGGAC 

ATGTATGTCCCTAAGAATOX^TACC^^ 

[G,A] 

GCACTTTGGGAAGCGW\GTCGGAGGATTGCrTGAGGCCGGGAGTTG\AGAC 
GCAAO^TAGTX^GACCCTATTTCTATTAAAAGTCAAAA^ 
GCTACTCAGCAGGCrGAGGTGGAAGGATCGCTR^GTTTGAGGCT 
CATGCCACGGCACTCTAGCCTGCATG^^ 

CTGGGCAGGGTGGCTCAC3GCCTX3TMTCCCAGCACrCTGGGAGGCCGA 

23751 TTCATGAGATCATGTGTGGGG I IGI I CATTGGCATGGGCTGTQGGTGTATAACCGCTGTC 
AGCATATGTATGTACACAGGA I I I CI I GTGTATGAGCATGGGTTGTGTGTATATGGACAC 
TGTTCATGTC I G I I I CTATAACAGGTMCCAAAGTC7GTATATGGTAGGGTGGTCTATAT 
GCAGGCTTGTGAATGTACTCCAGTTGCATGTCCCAGGCrCTGCATGTG^ 
GTATGI I I I <_ 1 1 GAGATTTTATTTTATTTTATTATTTATTTATTTA I I I I IGAGATCGAG 
[A,T] 

CTTGCTCTGTCAGGCAGGCrcGAGTGCAGTAG^ 
CTCTGVVGTrOVAGTGATTCTCCnKlCrCGGCCrCCCAACT 

CGCCACCAGGCCCTGCTAAI I I I IGTAI I I I I AGTAGAGACGGAGTTTCACCACGTTGGC 
CAGGCTGGTCTTGAACTCCCGACCTCAAGTGATCCGCCCACCTCGGCCT CCCAAAATGCT 
GAGATTACAGGCATGAGCCACTGOGCCCAGCCAATG I I I I C_ 1 1 GAGATTTTAAATGTGGG 

23764 GTGTGGGG I I G I I CATTGGCA7GGGCTG7GGG7GTATMCCGCT 

ACACAGGAI I IU IGIGI ATCAGCATGGGI IGIGI GTATATGGACACTGTTCATGTCTGT 
TTCTATMCAGGTMCCAMGTCTUTATATGGTAGGGTGGTGTATATGCAG^ I IG I G AA 
TGTACTCCAGTTGCATGTCC<^GGCTCTGCATGTGTAGGGGGTAGTAGTAT GI I I ICI IG 
AGATTTTATTTTATTTTATTATTTATTTATTTA I I I I IGAGATGGAGTCTTGCTCTGTCA 
[C,T] 

GCAGGCTGGAGTGCAGT AGCIGTGATCTPGGCKACTXK^NACCTCTGCCTCTCAAGTTCAA 
GTGATTCTCCTGCCTCGGCCTCCCAAGT AGCTGGGATTACAGGCATGCGCCACCAGGCCC 
TGCTAAI I 1 1 IGIAI I I I I AGTAGAGACGGAGTTTCACCACGTTGGCCAGGCTGGTCTTG 
AACTCCCGACCT CMGTGATCCGCCCACCTCGGCCTCCCAAMTGCTX^GATTACAGGCA 
TGAGCCACTGGGCCCAGCCAATG I I I I CI I GAGATTTTAAATGTGGGGCTATTGAATGCA 

23782 GCATGGGCTGrreGGTGTATMCCGCTGTCAGCATATGTATGTAC^ I MCI IGTGT 
ATGAGCATGGG I IGIGIGI ATATGGACACTGTTCATGTCTGTTTCTATAACAGGT AACCA 
MGTCTGTATATGGTAGGGTGGTGTATATGCAGGCTTGTGMTGTACTC 
CCCAGGCrCTGCATGTGTAGGGGGTAGTAGTATG 1 1 I ILI I GAGATTTTATTTTATTTTA 
TTATTTATTTATTTA I I I I IGAGATGGAGTCTTGCTCTGTCACGCAGGCTGGAGTGCAGT 
[G,A] 

GCGTGATCTTCGCTCACTGCAACCTCTGCCTCTCM 

CCTCCCAAGTAGCTGGGATTACAGGCATGCGCCACCAGGCCCTCCTAA I I I I IGIATTTT 
TAGTAGAGACGGAGTTTCACCACGTTO^CAG^ 

ATCCGCCCACCTCGGCCTCCCAAAATGCTGAGATTACAGGCATGAGCCACTG^ 
CAATGI I I ICI I GAGATTTTAMTGTGGGGCTATTGAATGCACCAGTGGTGGCTQSGGTG 

24593 TCGAAGCATCTTGGTGAGGGGCTGGGCTGGG 

GCCGGGGCCCTCACCTCACCTCCCGCCCCTCTCTTCCAGCTGACCAGTCAGGACA^ 

CCCAGCGTGGTCCGGCGAGCCrTGCAGAAGCACAATGTGCCCCAGCCCTG 

TATCAGCTCTTTCAAGTCCTTCCTGGGGACCGGGGTGAGCAGGGATGGG^ 
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GATAGGGGGG\GCGGGG^GGCGAGO\GACTGACCACGCCCAAGGATGGAGCCG\AGGTTA 
[C,T] 

CCGGGTTCACAGGGCPGTGAGCj I G(_ 1 1 CAGGCAGAGAGTAGGGGTAAGATMTCAGTGGA 

GGTAAGAGGA^TAAMTACCTOTAACCG^CGATGTAGGGrC^TGAGATTGTCTTG^ 

CAG7UTGAGAGAGAGGTACCAMGGTCATCTC 

CCAGGGTCCGGCCAGGCGCAGTCGCT^ 

CTGGCKXXATCACTTGAGGTCACX^ 

24673 TCC0GCCCCTCTCTTCO\G^ 

CTTXKAGAAGCACMTCTGCCCC^^ 
TCCTCGGGACCGGGGTT^GCAGGGATCGGTTCG^ 

CGAGCAGACTXy\CCACGCCCAAGGATlXHAGCCGV\GGrrACCCGGGTTCA(^GGGCTGTG 

AGGTCCTrOVGGCAGAGAGTAGGGGTAAGAT^^ 

[C,T] 

CTCTAACCCAACQ\TGTAGGGTCATCAGA I IGIU I GGeTCAGTGTX^GAGAGAGGTACC 

AAAGGTCATCXTCCTAAMTTTAAAAGACAATAAGATT^ 

GTCGCTCATGTOXnAATCCCAGCACTTO 

CAGGAGTTTGAGACCAGGCTGACCAACG^^ 

MTTAGTCGGGTTGTCGTCGCA^C^^^ 

24730 AGCCTTGCAGMGCACMTGTXXXCC^^ 

CCTTCCTGGGGACCGGGGTGAGC^GGGATGGGTTGGAGCrCAGGATAGGGGGCAGCGGGG 
AGGCG^GC^GACreACC^CGCCCAAQG^TGGAGCCCAAGGTTACCCGGGTTCAC^GGGCr 
GTGAGGTGCITG^GGCAGAGAGTAGGGGT^ 

ATACCTGTAACCCMCGATGrAGGGTC^TGAGATTGTCTTGGCrCAGrGTGAGAGAGAGG 
[T,C] 

ACCAAAGCTCATCTTCCTAAMTTTAAMGACAATAA^ 

GCAGTGGCTCATGTCTTjTMTCCCAGCACr^^ 

GGTCAGG^GTTTGAGACCAGGCTGACCMC^^ 

AAAMTTAGTGGGGTXnGGTCGCACACXCC^^ 

AGGAGAATMTTGCXrcMTCTGGGAGGCGGAGGTTO^^ 

24750 T(XCCCAGCCCTGGGCCTGT^ 

AGCAGGGATG3GTTCGAGCTCAGGA 

CCCAAGGATGGAGCCCAAGGTTACCOGGGrrCACAGGGCTGTGAG b IGL I I C AGGCAGAG 
AGT AGGGGT AAGATAATCAGTTX^GGTMGAGGAC^TAAMTACCKTrAACCCAACGATG 
TAGGGTCATCAGA I lULI I GGCTCAGTGTCAGAGAGAGGTACCAAAGGTCATCTTCCTA 
[A,G] 

MTTTAAAAGACAATAAGATTGTCOVGGGTCGGGCCAGGCGCA^ 

TCCCAGCACTTTGGGAGGTCAAGCTlQGGCGGATCACrrGAGGT 

GCTGACCMCGTGGTXWV\CCCCGTCrCTACGAAACATACAAAAATTAGTCGQ 

GGCACACTCOtnAGTCCCAGCTACTCAGGAGGCTGAGGCAGGAGMTM 

CTGGGAGGCGGAGGTTGCAGTGAGCOGAGATCATACCACTGCACTTC^ 

24764 GCCTXnX^CTATCAGCTCTTTCAAGTCCnXICTGGGGACCGGGOT 

GGAGCTCAGGATAGGGGGCAGOjGGGAGGCGAGCAGACT^CCACGCCCMGGATGGAGC 

CCAAGGTTACCCXXSGTTCAOVGGGGnGTGAGGTGCTTCAGGCAGAG 

MTCAGTGGAGGTMGAGGACATAAMTACCTCTAA 

TGTCTTGGCTCAGTGTGAGAGAGAG^ 

[A,G] 

TAAGATTGTCCAGGGTCCGGCCAGGGGCAGTGGCTCATGTCTGTAATCCC^ 

GAGGTCMGCTX^GGCGGATCACTTGAGGTCAGGAGTTTGAGACC^ 

TCAAACCCCGTCTCTAGGAAACATACAAAAATTACT 

GTCCCAGCT ACTCAGGAGGCTGAGGCAGGAGMTAATTGCTTGA^TCTCGGAGGCGG^ 
TTGCAGTGAGCCGAGATC^TACCACreCACrrCAGCCTGGGCAGCAGA^ 

24780 TCTITCMGTCCTTCCTGGGGACCGQGGTGAGCAGGGATGGGTTGGAGCTQ 
GGCAGCGGGGAGGCGAGCAGACTGACCACG^^ 
TCACAGGGCTUTCAGGTGCTTCAGGCAGAGAGrAGGGCT 
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AGGACATAAMTACCrGTMCCGV\CGATCTAGGGTC^TGAGATTGTCTTGGCrG\GTGT 

GAGAGAGAGGTACCAAAGGT<^TCTTCCTAAAA^ 

[A,T] 

CCGGC(^GGCGCAGTCGCT<^TGTC^ 
GATCACTTGAGGTCAGGAGTTTGAGAC^^ 

CGAMG\TACAAAMTTAGTCGGGTGTGGTGGCAC^CTCCrGTAGTCCC^GCTACTG^GG 
AGGCTCAGGCAGGAGAATAAI IU.I I (^AATCTCGGAGGCGG^GGTTGCAGTGAGCCGAG^ 
TCATACCACTCCACTTCAGCCl^^ 

24856 GK^GACTT^CCAOKCGA^GGATG^^ 

TGClTCAGGCAGA^GTAGGGGTMGATAATCAGTGGAQGTMGAGGACATAAAATACCr 

GT AACCCAACG^TXTTAGGGTCATGAGATTGTCTTOXTOVCTG^ 

AGGTCATCTTCCrAAMTTTAAMGACMTAAG^^ 

GGCTC^TGTCTUTMTCCCAGCACTTTGGGAGGTGV^GCTGGGOGGA 

[A,G] 

GAGTTTCAGACOVGGCTC^CCMG^^ 

TACTGGGCTtrnSGTiGGG^ 

ATMTTGCTTGWCTGGGAGGaKAGGTTGCA^ 

CAGCCTGGGOV^GVGAGGGAGACTOXJriTAAAAAAAAAAAA^ 

CGGAC^AGTGACAGAAGGGACTGrrTCTGACCrrCAATTTGTAGGATGGGCT 

26136 ATGGTO3Cr<^CGCCTCTAATC^^ 

TOW^GATCGAGAC^TCCrQGCGVACATCGTXyVAGCCTCGTCT 

AAMTTAGCCAQGCATGGreGCAQGQGCCTATAGrCCCAGCrACrCAG 

GAAGMTCACTTGMCCCAGGAGGTGAAGGTTGCAGTGAGC<^^ 

TCCAGCCTGGGGACAGAGTCAGACTCCGTCTCAAAAAAA^ 

[A,G] 

GTCGAGTCAGAGA^GGGAGTTJTAGAAGTTTGT 

I PG1 1 CAGGCrCAACCT^TGGGATGGGAGAGTWVGGGQVTCTXyaGCrT AG A 1 1 PGTGA 

GAACTAAbi rrcn i caccactcggaccctgaagttatctgmcttcggacgggagagagg 

CAMTGGATAGCCGGGGAAGCATCAGATTGTCCTGTCTGAOVGGG^ 
AGCGTATTCACGCTGAAGTACATGGCATCAGGTTGGCrGG^ 

26331 CCG\GGAGGTO\AGGTTGCAGTCAGCCMGAT^ 

GAGTGAGACTCC GTCTC AAAAAAAAAGA AAAM TA^ 

GGGAGTGTAGAAGTTTGTCTGATCTTAAG FTTGl AGCATCATGAGATTGTTGAGGCTCAA 
CCTGATGGGATGGGAGACTAAAGGGCATCTGGGCTTAGAT^ I I Id I I 

C^CC^CTGGGACCCTGAAGTTATCTGAACTTGGGACGGGAGAGAGGCAAATGGATAGCCG 
[C,G] 

GGAAGCATGAGATTGTCCTGTCTGACAGGGAG^GCMGGGAT^^ 

AAGT ACATGO^T^GGTTGGCTGGATATTAGGAAAGGATGCrreTGG I IGI ICAGGTGT 

TGAGTCnGAGGCCACAAGCTCGTGCAGGCTGGAAGTGGGAAGTTATTCAA 

ACAGCAGCATCGGATTGGCTGGGAGTCG I IGIGGGGGAGGGGTAGGGTGAGCAGGAAGTT 

GTTTCGOGGGGGGKXjTCTAGGGTGGrCTAAGTTTGC 

28362 CTGAGGCAGGAGAATTGCTTGAACCQGGMGGTGGAGGTTCC^GTGAG 
CACTGTACrCCAGCCTGGGTGACAGAGCAAGACCCrATCrCAAAAACAM 
AATGMCAMCAAAAAATTTTCTGAGrGTGGTCATATCAGACTCT 
GGAGGCTGAGCTXX^GMTCACCAGAGCCCTGGGAGGTTGMGCnXACT 
CTGGGCCCCTGCACTCCMCCTGGAGGACAGAGGGAGAT^ 
[A,G,C] 

TAAGAGCCCTMGAAAGGTGTTGAGTCGGGT^^ 

TCCTGTCTGGCCGGACACAGTGGCTCAGGAGGGTMTCCCA^ 

TMGAGGATTGCrTAAGTTCAGGAGTCCGAGAGCAGCCTGGGCAATACA^ 

TCTCACT AMTAMTAAATAMGGATCCTATCACACAAAG^GGGTTT AGGACTTCCTTCC 

CCAACAI 1 1 1 1 GGGGTGATATCCCTL I I I I CrACKTTATATATCGGAGAGTGACTAACre 

28745 GCTCAGGAGGGTMTCCCAGTACTTPGGGAGGCTGAGCT I I G il IAAGTTCAG 

GAGTCGGAGAGCAGCCTCGGCAATACAGTGAGATCACAT^ 
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GGATCCTATCACACAMGAGGGTTTAGGAGTCCTTCCCCAACA I I 1 1 1 GGGGTCATATG 
CCTCrrTTCTACTXjrATATATGG^ 

AMTAGCATCATTACCCATGAGTCMTAAGGGCTGTGAGGATGQGCCCl^ 
[T,C] 

CACCTTCTrCCTCTTCCTTntAC^G^^ 
GGGTTJnSGTGGCT^CATCTGTMTCCCAG^ 
GAGCCCAGGAGTTTOVGACCAGeCTT^^ 
TACAMGATTCGCOVGGCGOXJn^ 

GCAGGCGGATGVTX^GCTCAGGAGATCGAGACCATCCTCGCTAGGTGAAACCCCGTCrCT 
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